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5 CLONED PLANT P - HYDROXYPHENYL PYRUVIC ACID DIOXYGENASE 

This invention was made with Government support 
under Grant Number 93373069083 awarded by the U.S. 
Department of Agriculture. The Government has certain 
rights in the invention. 

10 Field Of The Invention 

The present invention relates to a molecular 
approach for modifying the synthesis of vitamin E, 
plastoquinone, and carotenoids in plants by use of a 
full-length cloned cDNA which encodes a p- 

15 hydroxyphenyl pyruvic acid dioxygenase enzyme. 

Background Of The Invention 
The chloroplasts of higher plants contain many 
unique, interconnected biochemical pathways that 
produce an array of secondary metabolite compounds 

20 which not only perform vital functions within the 
plant but are also important from agricultural and 
nutritional perspectives. Three such secondary 
metabolites are the lipid soluble, chloroplastically 
synthesized compounds vitamin E (a- tocopherol or 

.25 of-toc) , plastoquinones (PQ) , and carotenoids, which 

together perform many crucial biochemical functions in 
the chloroplast . PQ and vitamin E are quinone 
compounds synthesized by a common pathway in the 
plastid; carotenoids are tetraterpenoids synthesized 

30 by a separate plastid-localized pathway. 

Plastoquinone (PQ) often accounts for up to 50% 
of the total plastidic quinone pool in green tissues. 
The primary function of PQ is as a fundamental 
component of the photosynthetic electron transport 

35 chain, acting as an electron carrier between 
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photosystem II and the cytochrome b 6 f complex. PQ 
likely has other less well studied functions in 
plastids, namely in acting as a direct or intermediate 
electron carrier for a variety of other biosynthetic 
reactions in the chloroplast . 

Vitamin E is the second major class of 
chloroplastic quinones, accounting for up to 40% of 
the quinone pool in plastids. The essential 
nutritional value of tocopherols was recognized around 
1925, and the compound responsible for Vitamin E 
activity was first identified as or- tocopherol in 1936. 
a-Toc has a well -documented role in mammals as an 
antioxidant, and a similar, though less well 
understood antioxidant role in plants. Liebler, et 
al., Toxicolocrv 23:147-169, 1993 ; Hess, Ant i -oxidants 
in Higher Plants . CRC Press: 111-134, 1993. 

Carotenoids are a separate, diverse group of 
lipophilic pigments synthesized in plants, fungi, and 
bacteria. In photosynthet ic tissues, carotenoids 
function as accessory pigments in light harvesting and 
play important roles in photo-protection by quenching 
free radicals, singlet oxygen, and other reactive 
species. Siefermann-Harms /(S Phvsiol . Plantarum . 69:561- 
568, 1987. In the plastids of non-photosynthet ic 
tissues, high levels of carotenoids often accumulate 
providing the intense orange, yellow, and red 
coloration of many fruits, vegetables, and flowers 
(Pfander, Methods in Enzvm. . 213A, 3-13, 1992). In 
addition to their many functions in plants, 
carotenoids and their metabolites also have important 
functions in animals, where they serve as the major 
source of Vitamin A (retinol) , and have been 
identified as providing protection from some forms of 
cancer due to their antioxidant activities. Vitamin 
E's antioxidant activities are also thought to protect 
against some forms* of cancer, and may act 
synergistically with carotenoids in this regard. 
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Liebler, et al . , Toxicology 23:147-1^^^ 1993; Krinsky, 
J. Nutr. 119:123-126, 1989. 

Tocopherol and Plastoguinone Synthesis 

a-Tocopherol and plastoquinone are the most 
5 abundant quinones in the plastid and are synthesized 
by the common pathway shown in Figure 1 . The 
precursor molecule for both compounds, homogentisic 
acid (HGA) , is produced in the chloroplast from the 
shikimic acid pathway intermediate p-hydroxyphenyl 
10 pyruvic acid (pOHPP) , in an oxidat ion/decarboxylat ion 
reaction catalyzed by the enzyme p-hydroxyphenyl 
pyruvic acid dioxygenase (pOHPP dioxygenase) . 
Homogentisic acid is subject to 

phytylation/prenylation (phytylpyrophosphate and 
15 solanylpyrophosphate , C 20 and C 45 , respectively) coupled 
to a simultaneous decarboxylation by a phytyl/prenyl 
transferase to form the first true tocopherol and 
plastoquinone intermediates, 
2 -dernethylphytylplastoquinol and 
20 2-demethylplastoquinol-9 , respectively. A single ring 
methylation occurs on 2 -demethylplastoquinol to yield 
plastoquinol-9 (PQH 2 ) which is then oxidized to 
plastoquinone- 9 (PQ) . This oxidation is reversible 
and is the basis of electron transport by 
25 plastoquinone in the chloroplast. 

The preferred route, as established in spinach, 
for a-tocopherol formation from 

2 -dernethylphytylplastoquinol appears to be 1) ring 
methylation of the intermediate, 2-a- 

30 dernethylphytylplastoquinol, to yield 

phytylplastoquinol , 2) cyclization to yield 
d- tocopherol and, finally, 3) a second ring 
methylation to yield a- tocopherol . Ring methylation 
in both tocopherol and plastoquinone synthesis is 

35 carried out by a single enzyme that is specific for 
the site of methylation on the ring, but has 
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relatively broad substrate specificity and 
accommodates both classes of quinone compounds. This 
methylation enzyme is the only enzyme of the pathway 
that has been purified from plants to date. 
d'Harlingue, et al., J. Biol . Chem. 26:15200, 1985. All 
enzymatic activities of the a-toc/PQ pathway have been 
localized to the inner chloroplast envelope by cell 
fractionation studies except for pOHPP dioxygenase and 
the tocopherol cyclase enzyme. Difficulties with cell 
fractionation methods, low activities for some of the 
enzymes, substrate stability and availability and 
assay problems, make studying the pathway 
biochemically difficult . 

Vitamin E and PQ levels, ratios, and total 
amounts vary by orders of magnitude in different 
plants, tissues and developmental stages. Such 
variations indicate that the vitamin E and PQ pathway 
is both highly regulated and has the potential for 
manipulation to modify the absolute levels and ratios 
of the two end products. The pathway in Figure 1 
makes it clear that production of homogentisic acid by 
pOHPP dioxygenase is likely to be a key regulatory 
point for bulk flow through the pathway, both because 
HGA production is the first committed step in cx-toc/PQ 
synthesis, and also because the reaction is 
essentially irreversible. Therefore modifying the 
levels of HGA by modifying pOHPP dioxygenase activity 
should have a direct impact on the total a-toc/PQ 
biosynthetic accumulation in plant tissues, and, as 
described below, because of the connection of PQ and 
carotenoid synthesis, should also affect carotenoid 
synthesis in plant tissues. 

Carotenoid Biosynthesis; Quinones as Electron Carriers 

In plants, carotenoids are synthesized and 
accumulate exclusively in plastids via the pathway 
shown on the left-hand side of Figure 1. The first 
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in carotenoid synthesi^is the 
condensation of two molecules of the C 20 hydrocarbon 
geranylgeranyl pyrophosphate (GGDP) by the enzyme 
phytoene synthase, to form the colorless C 40 
5 hydrocarbon, phytoene. In oxygenic photosynthet ic 
organisms (e.g. plants, algae, and cyanobacteria) , 
phytoene undergoes two sequential desaturation 
reactions, catalyzed by phytoene desaturase, to 
produce f- carotene through the intermediate 

10 phytofluene. Subsequently, {"-carotene undergoes two 
further desaturat ions , catalyzed by f -carotene 
desaturase, to yield the red pigment lycopene . 
Lycopene is cyclized to produce either a-carotene or 
/3-carotene, both of which are subject to various 

15 hydroxylation and epoxidation reactions to yield the 
carotenoids and xanthophylls most abundant in 
photosynthetic tissues of plants, lutein, /?-carotene, 
violaxanthin and neoxanthin. 

The genes encoding the first two enzymes of the 

20 carotenoid pathway (phytoene synthase and phytoene 
desaturase) have been isolated and studied from a 
number of plant and bacterial sources in recent years. 
Sandmann, Eur. J. Biochem. 223:7-24, 1994. Phytoene 
desaturase has been the most intensively studied, both 

25 because it is a target for numerous commercially 

important herbicides , and also because the phytoene 
desaturation reaction is thought to be a rate limiting 
step in carotenoid synthesis. Molecular and 
biochemical studies suggest that two types of phytoene 

3 0 desaturase enzymes have evolved by independent 
evolution: the crtl-type found in anoxygenic 
photosynthetic organisms (e.g. Rhodobacter and 
Erwinia) , and the pds-type found in oxygenic 
photosynthetic organisms. Despite their differences 

3 5 in primary amino acid sequence, all phytoene 

desaturase enzymes contain a dinucleotide binding 
domain (FAD or NAD/NADP) , which in Capsicum annum has 
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been shown to be FAD. Hugueney et al . , Eur. J. 
Biochem. 209:399-407, 1992. Presumably, the bound 
dinucleotide in both types of phytoene desaturase 
enzymes is reduced during desaturation and reoxidized 
by an unknown reductant present in the plastid or 
bacterium. 

Several lines of evidence have suggested a role 
for quinones in the phytoene desaturation reaction in 
higher plants. Using isolated daffodil chromoplasts, 
Mayer and co-workers demonstrated that in an anaerobic 
environment, oxidized artificial quinones were 
required for the desaturation of phytoene while 
reduced quinones were ineffective. Mayer et al . , Eur . 
J. Biochem. 191:359-363, 1990. Further supporting 
evidence comes from studies with the triketone class 
of herbicides (e.g. Sulcotrione) , which cause phytoene 
accumulation in treated tissues but unlike the well- 
studied pyridazone class (e.g. Norflorazon (NFZ) ) do 
not directly affect the phytoene desaturase enzyme . 
Rather, triketone herbicides competitively inhibit 
pOHPP dioxygenase, an enzyme common to the synthesis 
of both plastoquinone and tocopherols, suggesting that 
one or more classes of quinones may play a role in 
carotenoid desaturation reactions. Schulz et al., 
FEBS 318:162-166, 1993/ Secor, Plant Physiol . 106: 
1429-1433; Beyer et al . , IUPAC Pure and Applied 
Chemistry 66 : 1047- 1056 , 1994 . 

Despite the well-studied, wide-spread importance 
of vitamin E, plastoquinone, and carotenoids to human 
nutrition, agriculture, and biochemical processes 
within plant cells, much remains unclear about their 
biosynthesis and accumulation in plant tissues. This 
uncertainty has in turn limited the potential for 
manipulation of the synthesis and levels of these 
important compounds in plants . 
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Summary of the Invention 
In one embodiment, this invention provides a 
biologically pure sample of DNA which DNA comprises a 
sequence coding for the expression of Arabidopsis 
5 thaliana p- hydroxyphenyl pyruvic acid dioxygenase. 

In other embodiments, this invention provides a 
vector and microbial host containing a DNA sequence 
sufficiently homologous to SEQ ID NO : 1 so as to code 
for the expression of Arabidopsis thaliana p- 
10 hydroxyphenyl pyruvic acid dioxygenase, and a genetic 
construct containing a DNA sequence sufficiently 
homologous to SEQ ID NO : 1 so as to code for the 
expression of Arabidopsis thaliana p- hydroxyphenyl 
pyruvic acid dioxygenase, together with a. promoter 
15 located 5' to the DNA coding sequence and a 3' 
termination sequence. 

In another embodiment, this invention provides a 
method of creating a transgenic plant in which the 
levels of the pOHPP dioxygenase enzyme are elevated. 
20 sufficient such that production of plastoquinones , 
vitamin E, and carotenoids are modified. 

It is an object of the present invention to 
genetically engineer higher plants to modify the 
production of plastoquinones, vitamin E, and 
25 carotenoids. 

It is another object of the invention to provide 
transgenic plants that would express elevated levels 
of the pOHPP dioxygenase enzyme which would have 
resultant elevated resistance to the triketone class 
30 of herbicides (i.e. sulcotrione) . 

It is another object of the present invention to 
provide a method for the preparation of the enzyme p- 
hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , an enzyme which can be used to identify 
35 new pOHPPdioxygenase- inhibiting herbicides. 

Other features and advantages of the invention 
will be apparent from the following description of the 
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preferred embodiments thereof and from the claims. 



Brief Description of the Drawings 
Fig. 1 is a diagram of the pathways for synthesis 
of carotenoids, vitamin E (tocopherol), and 
5 plastoquinone . 

Fig. 2 is a diagram of the interconnections of 
the pathways illustrated in Fig. 1. 

Fig. 3A-3E are graphs of pigment analyses of 
wild- type, NFZ-wt, and pdsl tissues. 
10 Fig. 4 is a physical map of the pdsl mutation 

relative to visible markers. 

Figs. 5A-5C present the results of C18 HPLC 
separation of lipid soluble pigments from wild-type 
plants on MS2 media, homozygous pdsl mutants on MS2 
15 media supplemented with pOHPP, and homozygous pdsl 

mutants on MS2 media supplemented with homogentist ic 
acid (HGA) . 

Figs. 6A-6B present the results of C8 HPLC 
analyses of quinones in NFZ-wt and pdsl tissues. 

20 Detailed Description Of The Invention 

As described above, both Vitamin E, 
plastoquinones and carotenoids are synthesized and 
accumulated in plastids by the pathways shown in 
Figure 1. This specification describes the 

25 identification, isolation, characterization and 
functional analysis of a higher plant pOHPP 
dioxygenase cDNA, its role in a- toe, PQ and carotenoid 
synthesis, and the use of this cDNA to modify pOHPP 
dioxygenase activity in plant tissues and hence the 

3 0 accumulation of one or more of the compounds 

plastoquinones, vitamin E, and carotenoids in plant 
tissues. The overexpression of pOHPP dioxygenase in 
transgenic plants will modify the enzyme- to- inhibitor 
ratio of plant tissues exposed to triketone 

35 herbicides, as compared to non- transgenic plants, 
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resulting xn increased herbicide resistance. The 
present specification also describes a genetic 
construct for use in the production of pOHPP 
dioxygenase, an enzyme useful in identifying new pOHPP 
dioxygenase- inhibiting herbicides . 

By genetic analysis the present inventors have 
shown that the vitamin E, plastoquinone , and 
carotenoid biosynthetic pathways are interconnected 
and share common elements as shown in Figure 2 . From 
mutational studies in Arabidopsis thaliana, the 
present inventors identified one genetic locus, 
designated pdsl (pds= phytoene desaturat ion) , the 
disruption of which results in accumulation of the 
first carotenoid of the carotenoid biosynthetic 
pathway, phytoene . Surprisingly, though this mutation 
disrupts carotenoid synthesis and was originally 
identified on this basis, it does not map to the locus 
encoding the phytoene desaturase enzyme. Evidence 
indicates that pdsl defines a second gene product in 
addition to the phytoene desaturase enzyme, necessary 
for phytoene desaturation and hence carotenoid 
synthesis in higher plants. This gene product proved 
to be pOHPP dioxygenase. 

To provide a molecular mechanism for manipulating 
synthesis and accumulation of the compounds 
plastoquinone, vitamin E, and carotenoids, the present 
inventors used a molecular genetic approach, taking 
advantage of the model plant system Arabidopsis 
thaliana to define, isolate and study genes required 
for synthesis of the compounds in plants. The 
flowering plant Arabidopsis thaliana has come into 
wide use as a model system to explore the molecular 
biology and genetics of plants. Arabidopsis offers 
many advantages for genetic analysis: it can be selfed 
and very large numbers of progeny can be obtained (up 
to 10,000 seeds from a single plant). Furthermore, 
Arabidopsis has a short generation time of five to six 
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weeks, so crosses can be set up and the progeny 
analyzed within reasonable periods of time. Mutation 
screens have identified thousands of mutations 
affecting many aspects of basic plant biology, 
including morphogenesis, photosynthesis, fertility, 
starch and lipid metabolism, mineral nutrition, an so 
on. In addition, its haploid genome is only about 10 s 
base pairs. 

An important aspect of the successful approach 
used here is that essential components were first 
functionally defined genetically, prior to their 
isolation, analysis and molecular manipulation. 
Briefly, potential mutants were identified by a 
combination of phenotypic and biochemical screening, 
characterized at the genetic and molecular levels, 
loci of interest selected, and the corresponding genes 
then cloned and studied further. By this approach, 
the inventors genetically defined and isolated cDNAs 
for one gene, pdsl, whose mutation disrupts synthesis 
of all three classes of compounds in the plastid, 
tocopherols/ plastoquinones and carotenoids. Based on 
biochemical analysis of the pdsl mutant, the pdsl gene 
was identified as affecting the activity of pOHPP 
dioxygenase, a crucial enzyme of the plastidic quinone 
pathway in plants (Figure 1) , that is directly 
required for the synthesis of plastoquinone and 
a-tocopherol and indirectly for carotenoid synthesis. 
In particular, the deduced function of the pdsl mutant 
and pOHPP dioxygenase enzyme are noted in Figure 2 . 

The present inventors demonstrated by biochemical 
complementation that the pdsl mutation affects the. 
enzyme p-hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , because pdsl plants can be rescued by 
growth on the product but not the substrate of this 
enzyme , homogentisic acid (HGA) and 

p-hydroxyphenylpyruvate (pOHPP) , respectively. pOHPP 
dipxygenase is the key branch point enzyme and 
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committed step in the synthesis of boCTr Vitamin E and 
plastoquinones and several independent lines of 
biochemical evidence confirm pdsl affects this enzyme 
(Figures 1, 5, 6). These results provide the first 
5 genetic evidence that plastoquinones are essential 

components for carotenoid synthesis in higher plants, 
most likely as an electron carrier/redox element in 
the desaturation reaction (Figure 2) . The Arabidopsis 
pOHPP dioxygenase gene/cDNA thus provides a basis for 

10 modifying the production of plastoquinones, 

a-tocopherol and carotenoids in all higher plants. 

Specifically, the specification describes the 
genetic identification of the Arabidopsis pOHPP 
dioxygenase gene by mutational analysis, the' physical 

15 isolation and functional confirmation of an' 

Arabidopsis pOHPP dioxygenase cDNA, its nucleotide 
sequence and its use to isolate pOHPP dioxygenase 
genes and cDNAs from other plant species. Also 
included in the specification is a description of the 

20 use of the Arabidopsis pOHPP dioxygenase cDNA, and 
related cDNAs from other plants, to positively or 
negatively modify the expression/activity of pOHPP 
dioxygenase by recombinant techniques (overexpression, 
cosuppression, antisense, etc.) in any and all plant 

25 tissues, especially leaf and fruit tissues, to 

positively or negatively affect the production of 
a- toe, PQ and carotenoids. 

Elevating pOHPP dioxygenase protein levels 
increases the amount of homogentisic acid (HGA) 

30 synthesized in plant tissues. Because HGA is the 

limiting precursor molecule for a -toe and PQ synthesis 
(the end products of the pathway), increasing HGA 
synthesis increases the levels of or-toc (Vitamin E) 
and PQ in plant tissues. The increase in PQ 

3 5 indirectly increases the synthesis of carotenoids, 
which require PQ for their synthesis. In addition, 
the increase in PQ increases photosynthetic efficiency 
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by increasing electron flow between photosystem II and 
photosystem I, because PQ is the primary electron 
transporter between, the two photosystems . The 
increase in a- toe, a well-studied antioxidant in 
mammals, increases the ability of plants to withstand 
oxidative stresses, such as that caused by high light, 
high temperature, water stress, ozone stress, UV 
stress or other abiotic or biotic stresses. Elevating 
the levels of pOHPP dioxygenase will modify the dose 
response curve of herbicides targeting pOHPP 
dioxygenase, thus increasing the relative resistance 
to such herbicides in transgenic plants as compared to 
native plants of the same species. Inhibiting the 
expression of pOHPP dioxygenase is expected to have 
the opposite effect . 

Genetic Construct 

To express pOHPP dioxygenase in a plant, it is 
required that a DNA sequence containing the pOHPP 
dioxygenase coding sequence be combined with 
regulatory sequences capable of expressing the coding 
sequence in a plant. A number of effective plant 
promoters, both constitutive and developmentally or 
tissue specific, are known to those of skill in the 
art. A transcriptional termination sequence 
(polyadenylation sequence) may also be added. Plant 
expression vectors, or plasmids constructed for 
expression of inserted coding sequences in plants, are 
widely used in the art to assemble chimeric plant 
expression constructs including the coding sequence, 
and to conveniently transfer the constructs into 
plants. A sequence which codes for pOHPP dioxygenase 
includes, for example, SEQ ID NO : 1 , or versions of the 
designated sequence sufficient to effect coding for 
the expression of pOHPP dioxygenase. Commonly used 
methods of molecular biology well-known to those of 
skill in the art may be used to manipulate the DNA 
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sequences . 

By "genetic construct" we mean any of a variety 
of ways of combining the protein-encoding sequences 
with a promoter sequence (and termination sequence, if 
necessary) in a manner that operably connects the 
promoter sequence (and termination sequence, if 
present) with the protein-encoding sequences. 
Typically, the promoter sequence will be "upstream" of 
a protein-encoding sequence, while the termination 
sequence, if used, will be "downstream" of the 
protein-encoding sequences . 

The protein-encoding, promoter and termination 
sequences may be combined on a plasmid or viral 
vector, and inserted into a microbial host. Other 
functional sequences may be added to the gene 
construct. Alternatively, the protein-encoding, 
promoter, and termination sequence, if added, may be 
combined with any other needed functional sequences 
and used without a vector. 

The DNA sequence described by SEQ ID NO:l is 
sufficient to effect coding for the expression of 
pOHPP dioxygenase. However, it is envisioned that the 
above sequence could be truncated and still confer the 
same properties. It is not known at present which . 
specific deletions would be successful, but it is 
likely that some deletions to the protein would still 
result in effective enzymatic activity. One skilled 
in the art of molecular biology would be able to take 
the designated sequence and perform deletional 
analysis experiments to determine what portions of the 
designated sequence are essential to effect coding for 
the expression of pOHPP dioxygenase . One could create 
a genetic construct with the candidate deletion 
mutations and perform experiments as described below 
in the Examples, to test whether such deletion 
mutation sequences effect coding for the enzyme. 
Expression of the enzyme activity indicates a 
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successful deletion mutant or mutants . In this 
manner, one could determine which parts of the 
designated sequence is essential for expression of the 
enzyme . 

5 It is also known that the genetic code is 

degenerate, meaning that more than one codon, or set 
of three nucleotides, codes for each amino acid. Thus 
it is possible to alter the DNA coding sequence to a 
protein, such as the sequence for pOHPP dioxygenase 

10 described here, without altering the sequence of the 

protein produced. Selection of codon usage may affect 
expression level in a particular host. Such changes 
in codon usage are also contemplated here. 

It is further contemplated that using the 

15 Arabidopsis pOHPP gene coding sequence described here, 
that the homologous pOHPP dioxygenase sequences from 
other higher plants can be readily recovered. 
Oligonucleotides can be made from the sequence set 
forth below to either hybridize against cDNA or 

20 genomic libraries or used for PGR amplification of 
homologous pOHPP dioxygenase sequences from other 
plants. 

Once a pOHPP gene is in hand, whether from 
Arabidopsis or from some other plant species, it then 

25 becomes possible to insert a chimeric plant expression 
genetic construct into any plant species of interest. 
Suitable plant transformation methods exist to insert 
such genetic constructs into most, if not all, 
commercially important plant species. Presently known 

30 methods include Agrobacterium-mediated transformation, 
coated-particle gene delivery (Biolistics) and 
electroporation, in which an electric voltage is used 
to facilitate gene insertion. All these methods, and 
others, can insert the genetic construct into the 

3 5 genome of the resulting transgenic plant in such a way 
that the genetic construct becomes an inheritable 
trait, transmitted to progeny of the original 
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transgenic plant by the normal rules diTMendelian 
inheritance. Thus, once a genetic construct 
expressing a pOHPP gene is inserted into a plant, it 
can become a part of a plant breeding program for 
5 transfer into any desired genetic background. 

To over-express pOHPP dioxygenase, a genetic 
construct may be used with a higher strength promoter. 
To inhibit expression of endogenous pOHPP dioxygenase, 
an antisense genetic construct can be made, as is 
10 known by those of skill in the art, to reduce the 
level of pOHPP dioxygenase present in the plant 
tissues . 

EXAMPLES 

Isolation of pdsl, a mutant defective in carotenoid 
15 synthesis 

To further understand carotenoid biosynthesis and 
its integration with other pathways in the chloroplast 
in higher plants, the present inventors studied the 
pathway by isolating Arabidopsis thaliana mutants 

20 that are blocked in carotenoid synthesis. 

Plants homozygous for defects in the early stages 
of carotenoid synthesis (e.g. prior to production of 
/3-carotene) are lethal when grown in soil and the 
isolation of such mutations requires the design of 

25 screening procedures to identify plants heterozygous 
for soil lethal mutations. The present inventors 
found that most soil lethal, homozygous 
pigment -deficient Arabidopsis mutants can be grown to 
near maturity in tissue culture on Murashige and Skoog 

3 0 basal media (Murashige and Skoog, Physiol . Plant. 
15:473-497, 1962) supplemented with sucrose (MS2 
media) . Under these conditions, photosynthesis and 
chloroplast development are essentially dispensable 
and all the energy and nutritional needs of the plant 

35 are supplied by the media. 

Greater than 500 lines from the 10,000 member 
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Feldmann T-DNA tagged Arabidopsis thaliana population 
(Forsthoefel et al . , Aust . J. Plant Physiol. 19:353- 
366, 1992) were selected for pigment analysis based on 
their segregation for lethal pigment mutations. Seed 
from plants heterozygous for lethal pigment mutations 
were surface sterilized, grown on MS2 media, the 
segregating pigment mutants identified, tissue 
harvested from individual plants, and HPLC pigment 
analysis performed. Although numerous mutant lines 
with severe pigment deficiencies were identified, only 
two were found to be carotenoid biosynthetic mutants. 
One mutant line isolated from this group, pdsl, is 
described in detail here. 

The hallmark phenotype for disruption of a 
biosynthetic pathway is the accumulation of an 
intermediate compound prior to the site of blockage. 
Such blockage of the carotenoid pathway can be 
mimicked chemically by treatment of wild-type plants 
with the herbicide NFZ, an inhibitor of the phytoene 
desaturase enzyme (Figure 1) which has been reported 
to cause accumulation of phytoene in treated tissues. 
Britton, Z. Naturforsch 34c:979-985, 1979. Figs. 3A- 
3E present the results of pigment analysis of wild- 
type, NFZ-wt, and pdsl tissues. Abbreviations in 
Figs. 3A-3E are as follows: N, neoxanthin; V, 
violaxanthin; L, lutein; Cb, chlorophyll b; Ca, 
chlorophyll a; (3 , /^-carotene. 

Figure 3A shows C 18 Reverse Phase HPLC analysis of 
the carotenoids that accumulate in wild- type 
Arabidopsis thaliana leaves. In comparison. Figure 3B 
shows the pigment profile for NFZ treated wild-type 
(NFZ-Wt) . Spectral analysis of the strongly absorbing 
296nm peak at 33 minutes in NFZ-Wt tissue shows 
absorbance maxima at 276, 286, and 298nm, indicative 
of phytoene (Figure 3D) . Figure 3C shows pigment 
analysis of tissue culture grown homozygous pdsl 
mutant plants. The low absorbance at 440nm in Figures 
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3B and C demonstrates that like NFZ-Wt, pdsl mutants 
lack all chlorophylls and carotenoids that normally 
accumulate in wild-type tissue (compare to Figure 3A) . 
However, unlike wild-type, pdsl mutants contain a peak 
with a retention time at approximately 33 minutes that 
absorbs strongly at 296nm. The retention time and 
absorbance of the 33 -minute peak in the pdsl mutant 
corresponds to the phytoene peak in pigment extracts 
of NFZ-Wt tissue (Figure 3B) . Spectral analysis of 
the 33 -minute peak from pdsl is shown in Figure 3E and 
is virtually identical to the spectra of phytoene from 
NFZ-Wt tissue (Figure 3D) as well as to the published 
spectra for phytoene. These results confirm the 
chemical identity of the accumulating compound in pdsl 
as phytoene and conclusively demonstrate that the pdsl 
mutation disrupts carotenoid biosynthesis. 

Carotenoid Analysis 

For quantitative and qualitative carotenoid 
analysis, plant tissue is placed in a microfuge tube, 
and ground with a micropestle in 200/il of 80% acetone. 
120ptl of ethyl acetate is added and the mixture 
vortexed. 140^1 of water is added and the mixture 
centrifuged for. 5 minutes. The carotenoid containing 
upper phase is then transferred to a fresh tube and 
vacuum dried in a Jouan RC1010 Centrifugal Evaporator. 
The dried extract is resuspended in ethyl acetate at a 
concentration of 0 . 5mg fresh weight of tissue per /il 
and either analyzed immediately by HPLC or stored at 
-80 *C under nitrogen. 

Carotenoids were separated by reverse -phase HPLC 
analysis on a Spherisorb ODS2 5 micron C 18 column, 25 
cm in length (Phase Separations Limited, Norwalk, CT) 
using a 45 minute gradient of Ethyl Acetate (0-100%) 
in Acetonitrile/water/triethylamine (9:1:0.01 v/v) , at 
a flow rate of 1 ml per minute (Goodwin and Britton, 
1988) . Carotenoids were identified by retention time 
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relative to known standards with detection at both 
296nm and 440nm. When needed, absorption spectra for 
individual peaks were obtained with a Hewlett Packard 
1040A photodiode array detector and compared with 
published spectra or available standards. 

Quinone analysis 

Quinones were extracted from tissue using a 
method modified from that described in Bligh et al . , 
Can. J, Biochem. Phvsiol. 37:911-917, 1959. Frozen 
plant tissue was ground in a mortar with 3 volumes of 
chloroform and 6 volumes of methanol and transferred 
to a test tube. Water and additional chloroform were 
added until a biphasic mixture was obtained. The 
quinone containing chloroform phase was then 
collected. To increase yields, the aqueous phase was 
back-extracted with chloroform, the two chloroform 
phases pooled, and then filtered through Whatman #3 
filter paper. The resulting filtrate was dried under 
a constant stream of nitrogen. Once dried, the pellet 
was resuspended in methanol at a concentration of lOmg 
fresh weight per ml and immediately analyzed by HPLC. 
Quinones were resolved by reversed-phase HPLC analysis 
on a LiChrosorb RP-8, 5 micron column, 25cm in length, 
(Alltech, San Jose, CA) using an isocratic solvent of 
10% H 2 0 in Methanol for the first 14 minutes, at which 
time the solvent was switched to 100% methanol for the 
remainder of the run (modified from the method 
described in Lichtenthaler , Handbook of 
Chromatography , CRC Press, 115-159, 1984). The flow 
rate was 1ml per minute for the duration. Peaks were 
identified . based upon the retention time of known 
standards with detection at 280nm for a- tocopherol and 
26 0nm for plastoquinone and ubiquinone as well as by 
absorption spectra from a Hewlett Packard 1040A 
photodiode array detector. When needed, fractions 
represented by individual chromatographic peaks were 



WO 97/27285 



PCT/US97/01384 



10 




15 



20 



25 



30 



BNSDOCtD: <WO_972728SA1 JU> 



collected, and submitted to the Southwest 
Environmental Health Science Center, Analytical Core 
laboratory for mass spectral analysis. Results were 
obtained using a TSQ7000 tandem mass spectrometer 
(Finnigan Corp., San Jose, CA) equipped with an 
atmospheric pressure chemical ionization source 
operated in the positive ion mode. The instrument was 
set to unit resolution and the samples were introduced 
into the source in a 0.3 ml/minute methanol stream and 
ionized using a 5kV discharge. 

Genetic analysis of pdsl 

The genetic nature of the pdsl mutation was 
determined by analyzing seeds resulting from selfing 
pdsl heterozygous plants. Prior to desiccation, Fl 
seeds were scored as either green (wild- type or 
heterozygous) or white (homozygous). A 3:1 
segregation ratio was observed (146 green seeds: 48- 
white seeds) , indicating that pdsl is inherited as 
single recessive nuclear mutations (X 2 =0.01, p > 0.90). 
Because pdsl mutants are inhibited in the desaturation 
of phytoene, the inventors believed that it might be a 
mutation in the phytoene desaturase enzyme, which had 
previously been mapped to chromosome 4, between ag ^and 
bp. Wetzel et al . , Plant J. 6:161-175, 1994. To test 
this hypothesis, the pdsl mutation was mapped relative 
to visible markers. The pdsl mutation was found to 
map to chromosome 1, approximately 7 cM from disl 
toward clv2. Franzmann et al., Plant J. 7:341-350, 
1995. These data points are summarized in Figure 4 
and establish that pdsl does not map to the phytoene 
desaturase enzyme locus, thus proving that the pdsl 
mutation is not in the phytoene desaturase enzyme. 
This data provided important insight for 
characterization of the pdsl mutant. 
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Homozygous pdsl mutants can be rescued by Homogentisic 
Acid, an intermediate in plas toguinone and tocopherol 
biosynthesis 

As described earlier, previous research 
suggesting a role for quinones and pOHPP dioxygenase 
in phytoene desaturation lead the present inventors to 
investigate the quinone biosynthetic pathway in the 
pdsl mutant . The early stages of 

plas toquinone/ tocopherol synthesis were functionally 
analyzed by growth in the presence of two intermediate 
compounds in the pathway, p-hydroxyphenylpyruvate 
(pOHPP) and homogentisic acid (HGA) (refer to Figures 
1 and 2). Albino pdsl homozygous plants were first 
germinated on MS2 media and then transferred to MS 2 
media supplemented with 100/^M of either pOHPP or HGA. 
pdsl plants remained albino when transferred to media 
containing pOHPP but greening occurred when pdsl 
plants were transferred to media containing HGA. 
Figs. 5A-5C present the results of complementation of 
the pdsl mutation with homogentisic acid. Each 
profile represents pigments extracted from lOmg fresh 
weight of tissue. Abbreviations used in Figs. 5A-5C 
are as described in Figs. 3A-3E. HPLC analysis with 
detection at 440nm of the carotenoids extracted from 
pdsl plants grown on pOHPP and HGA are shown in 
Figures 5B and C, respectively. The pigment profiles 
of pdsl mutants grown on pOHPP are similar to the 
profiles of pdsl plants grown on MS 2 media shown in 
Figure 3B. Comparison of the pigment profiles for 
pdsl + HGA tissue and wild- type tissue (Figures 5A and 
5C) indicates that growth in the presence of HGA is 
able to qualitatively restore a wild-type carotenoid 
profile to albino, homozygous pdsl plants. These 
results indicate that the pdsl mutation affects the 
enzyme pOHPP dioxygenase, because pdsl mutants are not 
altered by growth on the substrate of this enzyme, 
pOHPP, but rather, are restored qualitatively to 
wild- type pigmentation by growth on the product of 
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this enzyme, HGA (refer to Figures 1 and 2). The 
complementation of pdsl with HGA also indicates that 
intermediates or end products of this pathway 
(plastoquinone and/or tocopherols, refer to Figures 1 
and 2) are necessary components for phytoene 
desaturation in plants and confirms the observation of 
Schultz et al . in FEBS where inhibitors of pOHPP 
dioxygenase were shown to cause accumulation of 
phytoene . 

HPLiC analysis conclusively demonstrates that pdsl is a 
mutation in the plastoquinone/ tocopherol biosynthetic 
pathway that also affects carotenoid synthesis 

In addition to biochemical complementation of 
pdsl mutants, the plastoquinone/tocopherol pathway was 
also directly analyzed in pdsl tissue by utilizing C a 
HPLC to resolve total lipid extracts and identify 
three separate classes of quinones: ubiquinone, 
plastoquinone, and a- tocopherol (Vitamin E) (Figures 5 
and 6) . Ubiquinone and plastoquinone perform 
analogous electron transport functions in the 
mitochondria and chloroplast, respectively, but are 
synthesized by different pathways in separate 
subcellular compartments *< Goodwin et al . , Introduction 
to Plant Biochemistry , Oxford, Pergamon Press, 1983)*, 
making ubiquinone an ideal internal control in these 
analyses. Figure 6 shows the C 8 HPLC analysis of lipid 
soluble extracts from NFZ-Wt tissue and pdsl tissue. 
In NFZ-Wt tissue (Figure 6A) , peaks 3 and 4 were 
identified as ubiquinone and plastoquinone, 
respectively, based on retention time (26 and 27 
minutes) , optical spectra, and mass spectra (results 
not shown) . NFZ-Wt tissue contained a peak (1) with a 
retention time of 13.5 minutes which was identified as 
a- tocopherol based upon the retention time of a 
standard. However, optical spectroscopy and mass 
spectrometry demonstrated that peak 1 was composed of 
two major components: a- tocopherol (la) and an 
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unidentified compound (lb). The mass of ox- tocopherol 
was determined to be 430 as indicated by the presence 
of the 431 protonated molecule while the molecular 
mass of the unidentified compound was 412, as 
indicated by the presence of the 413 protonated 
molecule (data not shown) , clearly demonstrating the 
presence of two compound in peak 1. This quinone 
analysis demonstrates that the herbicide NFZ, which 
specifically inhibits the phytoene desaturase enzyme, 
does not affect synthesis of homogent isate derived 
quinones. pdsl tissue (Figure 6B) contain ubiquinone 

(peak 3) but lack plastoquinone (peak 4) . 
Additionally, though pdsl contains a peak at 13.5 
minutes, optical spectroscopy and mass spectrometry 
data demonstrate that this peak lacks oe- tocopherol 

(la) and is composed solely of the compound lb (data 
not shown) . Therefore, homozygous pdsl plants 
accumulate ubiquinone but lack both plastoquinone and 
a- tocopherol . This is consistent with the pdsl 
mutation affecting pOHPP dioxygenase (refer to Figures 
1 and 2) , as suggested by the rescue of the mutation 
by HGA, and provide additional evidence that the pdsl 
mutation disrupts pOHPP dioxygenase. 

Isolation of a truncated, putative pOHPP dioxygenase 
AraJbidopsin cDNA 

The observation of Schultz et al . demonstrating 
that inhibitors of pOHPP dioxygenase activity disrupt 
carotenoid synthesis and cause accumulation of 
phytoene provided important insight for the 
characterization of the pdsl mutant which in turn 
provided the present inventors with important insight 
for the isolation of a putative cDNA for the pdsl 
locus. In animals, genetic defects which inhibit the 
activity of pOHPP dioxygenase lead to tyrosinemia type 
I, a fatal inherited disease in aromatic amino acid 
catabolism characterized by the presence of high 
levels of pOHPP in the urine. 
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In an effort to further understand the nature of 
this disease, pOHPP dioxygenase cDNAs have been cloned 



(summarized in Ruetschi et al., Eur. J. Biochem. 
205:459-466, 1992). Amino acid identity between 



in comparison, their identity to bacterial homologs is 
very low, less than 28%. By using mammalian and 
bacterial sequences to search the Expressed Sequence 
Tags (ESTs) computer DNA database (Newman et al., 
Plant Physiol . 106:1241-1255, 1994), one partial 
length Arabidopsis EST was identified and used as a 
probe. The partial length Arabidopsis probe 
corresponds to base pairs 1072 through 1500 of SEQ ID 
NO : 1 . 

This cDNA contained only 9 9 amino acids of the 
carboxyl terminal portion of the protein coding 
region. The deduced protein sequence of this putative 
Arabidopsis pOHPP dioxygenase cDNA shows similar 
homology (-50% identity) to both the mammalian and 
bacterial pOHPP dioxygenases . Interestingly, the 
partial Arabidopsis sequence also contains a 15 amino 
acid insertion not found in the human or bacterial 
enzymes. Finally, alignment of six pOHPP dioxygenase 
sequences from mammals and bacteria identified three 
regions of high conservation, the highest being a 16 
amino acid region near the carboxy end of pOHPP 
dioxygenases that shows 62.5% identity across all 
phyla. Ruetschi et al., Eur. J. Biochem. 205:459-466, 
1992. This region is also present in the truncated 
Arabidopsis sequence. The lines of evidence suggest 
that the partial length Arabidopsis cDNA described 
above encodes a pOHPP dioxygenase, most likely the 
pdsl locus . 



from several mammalian and bacterial sources 



various mammalian pOHPP dioxygenase enzymes is >80%; 
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Isolation and Characterization of a full length 
AraJbid ops ispOHPP di oxygenase cDNA 

Utilizing the partial length Arabidopsis cDNA 
probe, an Arabidopsis cDNA library was screened by 
5 nucleic acid hybridization for full length cDNAs . A 
large number of hybridizing cDNAs were isolated, and 
one of the longest, pHPP1.5, containing a 1,520 bp 
insertion, was sequenced completely; the insert is 
presented as SEQ ID NO: 1. pHPPl . 5 encodes a 446 

10 amino acid protein (presented as SEQ ID NO : 2 ) , which 
is slightly larger in size than mammalian and 
bacterial pOHPP dioxygenases. pHPPl . 5 shows 34-40% 
identity at the amino acid level to pOHPP dioxygenases 
from various mammals and bacteria. In comparing four 

15 bacterial pOHPP dioxygenases and one mammalian pOHPP 
dioxygenases (pig) which ranged in size from 346-404 
amino acids, Denoya et al . identified 69 amino acids 
that were conserved between all five pOHPP 
dioxygenases. Denoya et al . , J. Bacteriol . 176:5312- 

20 5319, 1994. The pHPPl . 5 coding region contains 52 of 
these 69 conserved amino acids. 

Demonstration that pHPPl.5 encodes an active pOHPP 
dioxygenase protein and complements the pdsl mutation 

In order to definitively demonstrate that pHPP1.5 
25 is the gene product encoded by the pdsl locus and that 
it encodes a functional pOHPP dioxygenase protein, the 
pHPPl . 5 cDNA was cloned into a plant transformation 
vector for molecular complementation experiments with 
the pdsl. The full length wild-type pOHPP dioxygenase 
30 cDNA will be subcloned into a plant transformation 
vector driven by the Cauliflower Mosaic Virus 3 5S 
(CaMV) promoter and containing all necessary 
termination cassettes and selectable markers (Kan r ) . 
The CaMV promoter is a strong constitutive promoter. 
35 This single construct and the vector without the 

pHPPl . 5 insert (as a control) will be used in vacuum 
infiltration transformation which uses whole soil 
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grown plants and will be done on plants that are 
heterozygous for the pdsl mutation. Bouchez et al . , CR 
Acad. Sci. Paris, Sciences de la vie 316, 1993. 

In the standard procedure, 20-30 soil grown 
plants will be independently transformed and analyzed 
separately. In this case homozygous plants containing 
the pdsl mutation would be lethal while heterozygous 
plants containing the pdsl mutation would be 
segregating 2:1 for the pdsl mutation in their 
siliques. The inventors will use a similar number of 
wild type plants in a parallel transformation as a 
control. After transformation of the pdsl segregating 
plant population, as the plants are setting seed the 
inventors can easily identify those heterozygous for 
the pdsl mutation in retrospect by inspection of their 
siliques which would contain green: white embryos in a 
3 : 1 ratio. 

Seed harvested from individually transformed 
heterozygous pdsl plants will be germinated on 
kanamycin and resistant seedlings transferred to soil. 
Segregation analysis of seed from these primary 
transf ormants (T2 seed) and T3 seed for segregation of 
the pdsl phenotype (albino and phytoene accumulating)- 
and the T-DNA encoded kanamycin resistance marker 
(wild type pOHPP dioxygenase cDNA) will conclusively 
demonstrate complementation of the pdsl mutation with 
the pOHPP dioxygenase cDNA. To provide additional 
proof that pHPPl . 5 is encoded by the pdsl locus, the 
pHPPl . 5 cDNA has been mapped relative to the pdsl 
locus using recombinant inbred lines, as described in 
Lister et al . , Plant J. 4:745-750, 1993. The pHPPl . 5 
cDNA mapped to the region of chromosome 1 containing 
the pdsl mutation (Figure 4). Finally, the pHPPl . 5 
cDNA will be overexpressed in E. coli and the activity 
of the protein determined. 
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Modification of pOHPP Expression 

From the genetic and biochemical studies 
described above it is clear that only one pOHPP 
dioxygenase gene product is involved in chloroplastic 
5 quinone synthesis, that the pdsl mutation defines this 
gene, that the pHPPl . 5 cDNA is the product encoded by 
the pdsl locus and that disruption of its function 
completely eliminates Vitamin E production and 
plastoquinone and carotenoid synthesis in plant 

10 tissues. Modification of pOHPP dioxygenase expression 
in plants by molecular techniques using pHPPl . 5 can 
therefore be used to positively or negatively affect 
the production of tocopherols, plastoquinones directly 
and carotenoids indirectly (refer to Figures 1 and 2) . 

15 Specifically, overexpression of the pOHPP dioxygenase 
enzyme will result in increased levels of one or more 
of these compounds in the tissues of transgenic 
plants. Alternatively, using antisense techniques, it 
is possible to lower the level of enzyme activity to 

20 decrease the levels of these compounds in plants. 

Additionally, overexpression of the pOHPP dioxygenase 
will enable a transgenic plant to withstand elevated 
levels of herbicides that target this enzyme, 
providing agrinomically significant herbicide 

25 resistance relative to normal plants. 

Two different plant systems, Arabidopsis and 
tomato, are being used to demonstrate the effects of 
modified pOHPP dioxygenase in plant tissues. 
Constitutive overexpression of pOHPP dioxygenase will 

3 0 be done in both plant systems utilizing the CaMV 3 5S 
promoter and the pHPPl . 5 cDNA. The consequences of 
this altered expression on tocopherol, plastoquinone 
and carotenoid levels and profiles in various plant 
tissues will be determined as described below. In 

35 tomato, tissue specific overexpression, of pOHPP 

dioxygenase (pHPPl.5) will be driven by the fruit 
specific promoter derived from the tomato /3subunit 
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gene, which is expressed specifically in developing, 
but not ripening tomato fruit. This will determine 
the potential for modifying the levels of tocopherol, 
plastoquinone and carotenoids specifically in 
developing and ripening fruit for nutritional purposes 
without affecting their production in other plant 
tissues. These combined experiments will determine 
whether pOHPP dioxygenase is a rate limiting step in 
chloroplastic homogentisic acid derived quinone 
synthesis and the potential for manipulating 
chloroplastic homogentisic acid derived quinones 
(tocopherols and plastoquinones ) and compounds that 
^require quinones for their synthesis (carotenoids, 
etc) by increasing pOHPP dioxygenase activity. 

Multiple independent transf ormants will be 
produced for each construct and plant species used. 
The integration and gene copy number of each chimeric 
gene in each line will be confirmed by southern 
analysis, the level of pOHPP mRNA determined by 
Northern blot analysis, pOHPP dioxygenase activity 
determined as described in Schulz et al . , FEBS 
318:162-166, 1993, and the effects on individual 
chloroplastic components of interest analyzed 
(tocopherols, plastoquinones and carotenoids). In - 
green tissue containing constitut ively expressing 
constructs this analysis can occur relatively soon 
after transf ormants are put into soil. Analysis of 
fruit specific construct lines will require much more 
time for fruit set to occur. Analysis of tocopherols, 
plastoquinones and carotenoids will be by a 
combination of HPLC, optical and mass spectra as 
described in Norris et al . (1995, in press) . Analysis 
of tocopherol levels is performed by HPLC and when 
needed by GCrmass spectroscopy in selected ion mode. 
In MS analysis the absolute level of tocopherol will 
be quantified by isotopic dilution with a known, 
"heavy carbon" tocopherol standard added at the start 
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of the extraction. Determination based on fresh 
weight of tissue can also be performed. Plastoquinone 
levels will be quantified by C8 HPLC and optical 
spectra as described in Norris et al . (1995, in 
5 press) . Total carotenoid levels are determined 

spectrophotometrically and the levels of individual 
carotenes quantified by C18 HPLC and optical spectra 
quantified to standards. In the course of these 
experiments we will identify high expressing lines 
10 with simple insertions that segregate as single 
genetic loci in progeny. This will facilitate 
analysis of the inheritance of the gene and phenotype 
in future generations. 

Overexpression of pQHPP for in vitro Herbicide 
15 Analysis 

pOHPP dioxygenase will be overexpressed in E . 
coli or other prokaryotic or eukaryotic protein 
production systems and purified in large amounts for 
use in enzymatic assays for identifying new herbicide 
20 compounds (pOHPP inhibitors) and optimizing existing 
chemistries through detailed kinetic analysis. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: DellaPenna, Dean 

Norris, Susan 

5 (ii) TITLE OF INVENTION: Cloned Plant P-Hydroxyphenyl 

Pyruvic Acid Dioxygenase 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

10 (A) ADDRESSEE: Quarles & Brady 

(B) STREET: PO Box 2113 

(C) CITY: Madison 

(D) STATE: WI 

(E) COUNTRY: USA 
15 ( F) ZIP: 53701-2113 

<v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

2 0 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

2 5 (viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Seay, Nicholas J 

(B) REGISTRATION NUMBER: 27,386 

(C) REFERENCE /DOCKET NUMBER: 920214.90158 

(ix) TELECOMMUNICATION INFORMATION: 

3 0 (A) TELEPHONE: 608-251-5000 

(B) TELEFAX: 608-251-9166 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 

3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE : 

(B) CLONE: pHPPl . 5 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
45 (B) LOCATION: 37.. 1374 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCACGCGTCC G AGTTTT AG C AGAGTTGGTG AAATCA ATG GGC CAC CAA AAC GCC 54 

Met Gly His Gin Asn Ala 
1 5 

5 GCC GTT TCA GAG AAT CAA AAC CAT GAT GAC GGC GCT GCG TCG TCG CCG 102 

Ala Val Ser Glu Asn Gin Asn His Asp Asp Gly Ala Ala Ser Ser Pro 
10 15 20 

GGA TTC AAG CTC GTC GGA TTT TCC AAG TTC GTA AG A AAG AAT CCA AAG 150 
Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn Pro Lys 
10 . 25 30 35 

TCT GAT AAA TTC AAG GTT AAG CGC TTC CAT CAC ATC GAG TTC TGG TGC 198 
Ser Asp Lys Phe Lys Val Lys Arg Phe His His lie Glu Phe Trp Cys 
40 45 50 

GGC GAC GCA ACC AAC GTC GCT CGT CGC TTC TCC TGG GGT CTG GGG ATG 24 6 
15 Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met 

55 60 65 70 

AGA TTC TCC GCC AAA TCC GAT CTT TCC ACC GGA AAC ATG GTT CAC GCC 2 94 
Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His Ala 
75 80 85 

20 TCT TAC CTA CTC ACC TCC GGT GAC CTC CGA TTC CTT TTC ACT GCT CCT 3 42 

Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro 
90 95 100 

TAC TCT CCG TCT CTC TCC GCC GGA GAG ATT AAA CCG ACA ACC ACA GCT 3 90 
Tyr Ser Pro Ser Leu Ser Ala Gly Glu lie Lys Pro Thr Thr Thr Ala 
25 105 110 115 

TCT ATC CCA AGT TTC GAT CAC GGC TCT TGT CGT TCC TTC TTC TCT TCA 438 
Ser lie Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser 
120 125 130 

CAT GGT CTC GGT GTT AGA GCC GTT GCG ATT GAA GTA GAA GAC GCA GAG 4 86 
3 0 His Gly Leu Gly Val Arg Ala Val Ala lie Glu Val Glu Asp Ala Glu 

135 140 145 150 

TCA GCT TTC TCC ATC AGT GTA GCT AAT GGC GCT ATT CCT TCG TCG CCT 534 
Ser Ala Phe Ser lie Ser Val Ala Asn Gly Ala lie Pro Ser Ser Pro 
155 160 165 

3 5 CCT ATC GTC CTC AAT GAA GCA GTT ACG ATC GCT GAG GTT AAA CTA TAC 58 2 

Pro lie Val Leu Asn Glu Ala Val Thr lie Ala Glu Val Lys Leu Tyr 
170 175 180 

GGC GAT GTT GTT CTC CGA TAT GTT AGT TAC AAA GCA GAA GAT ACC GAA 6 30 
Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu 
40 185 190 195 

AAA TCC GAA TTC TTG CCA GGG TTC GAG CGT GTA GAG GAT GCG TCG TCG 6 78 
Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser 
200, 205 210 

TTC CCA TTG GAT TAT GGT ATC CGG CGG CTT GAC CAC GCC GTG GGA AAC 726 

4 5 Phe Pro Leu Asp Tyr Gly lie Arg Arg Leu Asp His Ala Val Gly Asn 

215 220 225 . 230 

GTT CCT GAG CTT GGT CCG GCT TTA ACT TAT GTA GCG GGG TTC ACT GGT 7 74 
Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly 
235 240 245 
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TTT CAC CAA TTC GCA GAG TTC ACA GCA GAC GAC GTT GGA ACC GCC GAG 82 2 
Phe His Gin Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu 
250 255 260 

AGC GGT TTA AAT TCA GCG GTC CTG GCT AGC AAT GAT GAA ATG GTT CTT 8 70 
5 Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu 

265 270 275 

CTA CCG ATT AAC GAG CCA GTG CAC GGA ACA AAG AGG AAG AGT CAG ATT 918 
Leu Pro lie Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gin lie 
280 285 290 

10 CAG ACG TAT TTG GAA CAT AAC GAA GGC GCA GGG CTA CAA CAT CTG GCT 96 6 

Gin Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gin His Leu Ala 
295 300 305 310 

CTG ATG AGT GAA GAC ATA TTC AGG ACC CTG AGA GAG ATG AGG AAG AGG 1014 
Leu Met Ser Glu Asp He Phe Arg Thr Leu Arg Glu Met Arg Lys Arg 
15 315 320 325 

AGC AGT ATT GGA GGA TTC GAC TTC ATG CCT TCT CCT CCG CCT ACT TAC 106 2 
Ser Ser He Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr 
330 335 340 

TAC CAG AAT CTC AAG AAA CGG GTC GGC GAC GTG CTC AGC GAT GAT CAG 1110 
2 0 Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gin 

345 350 355 

ATC AAG GAG TGT GAG GAA TTA GGG ATT CTT GTA GAC AGA GAT GAT CAA 115 8 
He Lys Glu Cys Glu Glu Leu Gly He Leu Val Asp Arg Asp Asp Gin 
360 365 370 

2 5 GGG ACG TTG CTT CAA ATC TTC ACA AAA CCA CTA GGT GAC AGG CCG ACG 1206 

Gly Thr Leu Leu Gin He Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr 
375 380 385 390 

ATA TTT ATA GAG ATA ATC CAG AGA GTA GGA TGC ATG ATG AAA GAT GAG 12 54 
He Phe He Glu He He Gin Arg Val Gly Cys Met Met Lys Asp Glu 
30 395 400 405 

GAA GGG AAG GCT TAC CAG AGT GG^ GGA TGT GGT GGT TTT GGC AAA GGC 13 02 
Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly 
410 415 420 

AAT TTC TCT GAG CTC TTC AAG TCC ATT GAA GAA TAC GAA AAG ACT CTT 13 50 

3 5 Asn Phe Ser Glu Leu Phe Lys Ser He Glu Glu Tyr Glu Lys Thr Leu 

425 430 435 

GAA GCC AAA CAG TTA GTG GGA TGA ACAAGAAGAA G AAC C AACTA AAGGATTGTG 14 04 
Glu Ala Lys Gin Leu Val Gly * 
440 445 

4 0 TAATTAATGT AAAACTG TTT TATCTTATCA AAACAATGTT ATACAACATC TCATTTAAAA 14 64 

ACGAGATCAA TCAAAAAATA CAATCTTAAA TTCAAAACCA AAAAAAAAAA AAAAA 1519 

( 2 J INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 
45 (B) TYPE: amino acid 

.(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly His Gin Asn Ala Ala Val Ser Glu Asn Gin Asn His Asp Asp 
1 5 10 15 

Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe 
5 20 25 30 

Val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His 
35 40 45 

His lie Glu Phe Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe 
50 55 60 

10 Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr 

€5 70 75 80 

Gly Asn Met Val His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg 
85 90 95 

Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu lie 
15 100 105 110 

Lys Pro Thr Thr Thr Ala Ser lie Pro Ser Phe Asp His Gly Ser Cys 
115 120 125 

Arg Ser Phe Phe Ser Ser His Gly Leu Gly Val Arg Ala Val Ala lie 
130 135 140 

20 Glu Val Glu Asp Ala Glu Ser Ala Phe Ser lie Ser Val Ala Asn Gly 

145 150 155 160 

Ala lie Pro Ser Ser Pro Pro He Val Leu Asn Glu Ala Val Thr He 
165 170 175 

Ala Glu Val Lys Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr 
25 180 185 190 

Lys Ala Glu Asp Thr Glu Lys. Ser Glu Phe Leu Pro Gly Phe Glu Arg 
195 200 e 205 

Val Glu Asp Ala Ser Ser Phe Pro Leu Asp Tyr Gly He Arg Arg Leu 
210 215 220 

3 0 Asp His Ala Val Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr 

225 230 235 240 

Val Ala Gly Phe Thr Gly Phe His Gin Phe Ala Glu Phe Thr Ala Asp 
245 250 255 

Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser 
35 260 265 270 

Asn Asp Glu Met Val Leu Leu Pro tie Asn Glu Pro Val His Gly Thr 
275 280 285 

Lys Arg Lys Ser. Gin He Gin Thr Tyr Leu Glu His Asn Glu Gly Ala 
290 295 300 

4 0 Gly Leu Gin His Leu Ala Leu Met Ser Glu Asp He Phe Arg Thr Leu 

305 310 315 . 320 

Arg Glu Met Arg Lys Arg Ser Ser He Gly Gly Phe Asp Phe Met Pro 
325 330 335 

Ser Pro Pro Pro Thr Tyr Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp 
45 340 345 350 
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Val Leu Ser Asp Asp Gin lie Lys Glu Cys Glu Glu Leu Gly lie Leu 

355 360 365* 

Val Asp Arg Asp Asp Gin Gly Thr Leu Leu Gin lie Phe Thr Lys Pro 
370 375 380 

5 Leu Gly Asp Arg Pro Thr lie Phe lie Glu lie lie Gin Arg Val Gly 

385 390 395 400 

Cys Met Met Lys Asp Glu Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys 
405 410 415 

Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu Phe Lys Ser He Glu 
10 420 425 430 

Glu Tyr Glu Lys Thr Leu Glu Ala Lys Gin Leu Val Gly * 

435 440 445 
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5 CLAIMS 

I claim: 



1. A biologically pure sample of DNA, the DNA 
comprising a DNA sequence coding for the expression of 
a plant p-hydroxyphenyl pyruvic acid dioxygenase . 

io 2 . A vector containing the DNA sequence of 

claim 1. 

3. A microbial host transformed by the vector 
of claim 2 . 

4. The DNA of claim 1, wherein the p- 

15 hydroxyphenyl pyruvic acid dioxygenase is from 
Arabidopsis thaliana. 

5. A transgenic tomato plant transformed with a 
DNA construct including the DNA of claim 1. 

6 . A transgenic Arabidopsis plant transformed 
20 with a DNA construct including the DNA of claim 1 

7. The biologically pure DNA of claim 1 wherein 
the DNA is SEQ ID NO: 2. 

8. A DNA plant gene expression construct 
comprising : 

a . a DNA sequence coding for the 
expression of a plant p-hydroxyphenyl pyruvic acid 
dioxygenase ; 

b. a promoter effective in plant cells 
located 5' to the DNA coding sequence; and 

c. a 3' termination sequence effective in 
plant cells. 



25 
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9. A DNA construct comprising: 

a. a promoter capable of expressing a 
downstream coding sequence in a tomato plant; 

b. a DNA sequence coding for the 
5 expression of a p-hydroxyphenyl pyruvic acid 

dioxygenase of plant origin; and 

c. a 3' termination sequence, the 
construction capable of expressing a p-hydroxyphenyl 
pyruvic acid dioxygenase gene when transformed into 

10 tomato plants. 

10. A bacteria containing the construction of 
Claim 9. 

11. A tomato plant cell containing the 
construction of Claim 9. 

15 12 . An Arabidopsis plant cell containing the 

construction of Claim 9. 

13. A transgenic tomato plant comprising in its 
genome a foreign genetic construction comprising, 5' 
to 3', a promoter effective in tomato, a DNA coding 
20 region encoding p-hydroxyphenyl pyruvic acid 

dioxygenase, and a transcriptional terminator, the 
genetic construction effective in vivo in tomato 
plants to stimulate expression of p-hydroxyphenyl 
pyruvic acid dioxygenase. 

25 14. Seed of the tomato plant of claim 13. 

15. Fruit of the tomato plant of claim 13. 

16. The transgenic tomato plant of claim 11 
wherein the DNA coding region encoding p-hydroxyphenyl 
pyruvic acid dioxygenase is that set forth in SEQ ID 

30 NO:l. 



■35- 



BN8DOCID: <WO_9727286A1JU» 



WO 97/27285 




PCT/US97/01384 



17. A method of suppressing the production of 
vitamin E and plastoquinones in a plant, the method 
comprising the steps of : 

a. isolating a DNA sequence encoding a p- 
5 hydroxyphenyl pyruvic acid dioxygenase of plant 

origin; 

b. creating a genetic construction including, 

5' to 3', a promoter effective in the plant's cells, a 
coding sequence, and a transcriptional terminator, the 
10 coding region being derived from the DNA sequence, 
wherein the DNA sequence from step (a) has been 
altered so that expression of p- hydroxyphenyl pyruvic 
acid dioxygenase is suppressed; and 

c. transforming a cell of the plant with the 
is genetic construction, whereby the plant's cell 

produces lowered levels of vitamin E and 
plastoquinones. 
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5 CLONED PLANT P-HYDROXYPHENYL PYRUVIC ACID DIOXYGENASE 

This invention was made with Government support 
under Grant Number 933 73 069083 awarded by the U.S. 
Department of Agriculture. The Government has certain 
rights in the invention. 

10 Field Of The invent- -ion 

The present invention relates to a molecular 
approach for modifying the synthesis of vitamin E, 
plastoquinone, and carotenoids in plants by use of a 
full-length cloned cDNA which encodes a p- 

15 hydroxyphenyl pyruvic acid dioxygenase enzyme. 

Backornnnrj pf T hg Invenf ion 
The chloroplasts of higher plants contain many 
unique, interconnected biochemical pathways that 
produce an array of secondary metabolite compounds 
which not only perform vital functions within the 
plant but are also important from agricultural and 
nutritional perspectives. Three such secondary 
metabolites are the lipid soluble, chloroplastically 
synthesized compounds vitamin E (a- tocopherol or 
a-toc), plastoquinones (PQ) , and carotenoids, which 
together perform many crucial biochemical functions in 
the chloroplast. PQ and vitamin E are quinone 
compounds synthesized by a common pathway in the 
plastid; carotenoids are tetraterpenoids synthesized 
by a separate plast id-localized pathway. 

Plastoquinone (PQ) often accounts for up to 50% 
of the total plastidic quinone pool in green tissues. 
The primary function of PQ is as a fundamental 
component of the photosynthetic electron transport 
chain, acting as an electron carrier between 
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photosystem II and the cytochrome bgf complex. PQ 
likely has other less well studied functions in 
plastids, namely in acting as a direct or intermediate 
electron carrier for a variety of other biosynthetic 
5 reactions in the chloroplast . 

Vitamin E is the second major class of 
chloroplastic quinones, accounting for up to 40% of 
the quinone pool in plastids. The essential 
nutritional value of tocopherols was recognized around 
10 1925/ and the compound responsible for Vitamin E 

activity was first identified as of-tocopherol in 1936. 
cy-Toc has a well -documented role in mammals as an 
antioxidant, and a similar, though less well 
understood antioxidant role in plants. Liebler, et 
15 al., Toxicology 2 3 : 14 7-169, 1993; Hess, Ant i -oxidants 
in Higher Plants , CRC Press: 111-134, 1993. 

Carotenoids are a separate, diverse group of 
lipophilic pigments synthesized in plants, fungi, and 
bacteria. In photosynthetic tissues, carotenoids 
20 function as accessory pigments in light harvesting and 
play important roles in photo-protection by quenching 
free radicals, singlet oxygen, and other reactive 
species. Sief ermann-Harms , Physiol . Plantarum . 69:561- 
568, 1987. In the plastids of non-photosynthetic 
25 tissues, high levels of carotenoids often accumulate 
providing the intense orange, yellow, and red 
coloration of many fruits, vegetables, and flowers 
(Pfander, Methods in Enzvm. , 213A, 3-13, 1992). In 
addition to their many functions in plants, 
30 carotenoids and their metabolites also have important 
functions in animals, where they serve as the major 
source of Vitamin A (retinol) , and have been 
identified as providing protection from some forms of 
cancer due to their antioxidant activities. Vitamin 
35 E's antioxidant activities are also thought to protect 
against some forms of cancer, and may act 
synergistically with carotenoids in this regard. 
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Liebler, ^^1., Toxicology 23:147-1^^ 1993 ; Krinsky, 
J. Nutr. 119:123-126, 1989. 

Tocopherol and Plastoquinone Synthesis 

o?-Tocopherol and plastoquinone are the most 
5 abundant quinones in the plastid and are synthesized 
by the common pathway shown in Figure 1 . The 
precursor molecule for both compounds, homogentisic 
acid (HGA) , is produced in the chloroplast from the 
shikimic acid pathway intermediate p-hydroxyphenyl 
10 pyruvic acid (pOHPP) , in an oxidat ion/decarboxylat ion 
reaction catalyzed by the enzyme p-hydroxyphenyl 
pyruvic acid dioxygenase (pOHPP dioxygenase) . 
Homogentisic acid is subject to 

phytylation/prenylation (phytylpyrophosphate and 
15 solanylpyrophosphate, C 20 and C 45 , respectively) coupled 
to a simultaneous decarboxylation by a phytyl/prenyl 
transferase to form the first true tocopherol and 
plastoquinone intermediates , 
2-demethylphytylplastoquinol and 
20 2-demethylplastoquinol-9 , respectively. A single ring 
methylation occurs on 2 -demethylplastoquinol to yield 
plastoquinol-9 (PQH 2 ) which is then oxidized to 
plastoquinone- 9 (PQ) . This oxidation is reversible 
and is the basis of electron transport by 

2 5 plastoquinone in the chloroplast. 

The preferred route, as established in spinach, 
for or-tocopherol formation from 

2-demethylphytylplastoquinol appears to be 1) ring 
methylation of the intermediate, 2-a- 

3 0 demethylphytylplastoquinol, to yield 

phytylplastoquinol, 2) cyclization to yield 
d- tocopherol and, finally, 3) a second ring 
methylation to yield or- tocopherol . Ring methylation 
in both tocopherol and plastoquinone synthesis is 
carried out by a single enzyme that is specific for 
the site of methylation on the ring, but has 



-3 



35 



BN8DOCID: <WO_97Z7286A1 _JA> 



WO 97/27285 



PCT7US97/0I384 




10 



15 



20 



25 



30 



relatively broad substrate specificity and 
accommodates both classes of quinone compounds. This 
methylation enzyme is the only enzyme of the pathway 
that has been purified from plants to date. 
d'Harlingue, et al . , J.Biol.Chem. 26:15200, 1985. All 
enzymatic activities of the a-toc/PQ pathway have been 
localized to the inner chloroplast envelope by cell 
fractionation studies except for pOHPP dioxygenase and 
the tocopherol cyclase enzyme. Difficulties with cell 
fractionation methods, low activities for some of the 
enzymes, substrate stability and availability and 
assay problems, make studying the pathway 
biochemically difficult. 

Vitamin E and PQ levels, ratios, and total 
amounts vary by orders of magnitude in different 
plants, tissues and developmental stages. Such 
variations indicate that the vitamin E and PQ pathway 
is both highly regulated and has the potential for 
manipulation to modify the absolute levels and ratios 
of the two end products. The pathway in Figure 1 
makes it clear that production of homogentisic acid by 
pOHPP dioxygenase is likely to be a key regulatory 
point for bulk flow through the pathway, both because 
HGA production is the first committed step in a-toc/PQ 
synthesis, and also because the reaction is 
essentially irreversible. Therefore modifying the 
levels of HGA by modifying pOHPP dioxygenase activity 
should have a direct impact on the total ar-toc/PQ 
biosynthetic accumulation in plant tissues, and, as 
described below, because of the connection of PQ and 
carotenoid synthesis, should also affect carotenoid 
synthesis in plant tissues. 

Carotenoid Biosynthesis; Quinones as Electron Carriers 

In plants, carotenoids are synthesized and 
accumulate exclusively in plastids via the pathway 
shown on the left-hand side of Figure 1. The first 
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committed ^^^p in carotenoid synthesizes the 

condensation of two molecules of the C 20 hydrocarbon 
geranylgeranyl pyrophosphate (GGDP) by the enzyme 
phytoene synthase, to form the colorless C 40 
5 hydrocarbon, phytoene. In oxygenic photosynthetic 
organisms (e.g. plants, algae, and cyanobacteria) , 
phytoene undergoes two sequential desaturation 
reactions, catalyzed by phytoene desaturase, to 
produce f-carotene through the intermediate 
10 phytofluene. Subsequently, f -carotene undergoes two 
further desaturat ions , catalyzed by f-carotene 
desaturase, to yield the red pigment lycopene . 
Lycopene is cyclized to produce either ar-carotene or 
^-carotene, both of which are subject to various 
15 hydroxylation and epoxidation reactions to yield the 
carotenoids and xanthophylls most abundant in 
photosynthetic tissues of plants, lutein, /3- carotene, 
violaxanthin and neoxanthin. 

The genes encoding the first two enzymes of the 
20 carotenoid pathway (phytoene synthase and phytoene 
desaturase) have been isolated and studied from a 
number of plant and bacterial sources in recent years. 
Sandmann, Eur. J. Biochem. 223:7-24, 1994. Phytoene 
desaturase has been the most intensively studied, both 
25 because it is a target for numerous commercially 

important herbicides, and also because the phytoene 
desaturation reaction is thought to be a rate limiting 
step in carotenoid synthesis. Molecular and 
biochemical studies suggest that two types of phytoene 
3 0 desaturase enzymes have evolved by independent 
evolution: the cr tT- type found in anoxygenic 
photosynthetic organisms (e.g. RhodoJbacter and 
Erwinia) , and the pds-type found in oxygenic 
photosynthetic organisms. Despite their differences 
35 in primary amino acid sequence, all phytoene 

desaturase enzymes contain a dinucleotide binding 
domain (FAD or NAD/NADP) , which in Capsicum annum has 
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been shown to be FAD. Hugueney et al . , Eur. J . 
Biochem. 209 : 399-407, 1992 . Presumably, the bound 
dinucleotide in both types of phytoene desaturase 
enzymes is reduced during desaturation and reoxidized 
by an unknown reductant present in the plastid or 
bacterium. 

Several lines of evidence have suggested a role 
for quinones in the phytoene desaturation reaction in 
higher plants. Using isolated daffodil chromoplasts , 
Mayer and co-workers demonstrated that in an anaerobic 
environment, oxidized artificial quinones were 
required for the desaturation of phytoene while 
reduced quinones were ineffective. Mayer et al . , Eur . 
J. Biochem. 191:359-363, 1990. Further supporting 
evidence comes from studies with the triketone class 
of herbicides (e.g. Sulcotrione) , which cause phytoene 
accumulation in treated tissues but unlike the well- 
studied pyridazone class (e.g. Norf lorazon (NFZ) ) do 
not directly affect the phytoene desaturase enzyme. 
Rather, triketone herbicides competitively inhibit 
pOHPP dioxygenase, an enzyme common to the synthesis 
of both plastoquinone and tocopherols, suggesting that 
one or more classes of quinones may play a role in 
carotenoid desaturation reactions. Schulz et al . , 
FEBS 318:162-166, 1993; Secor, Plant Phvsiol . 106: 
1429-1433; Beyer et al . , IUPAC Pure and Applied 
Chemistry 66:1047-1056, 1994. 

Despite the well -studied , wide-spread importance 
of vitamin E, plastoquinone, and carotenoids to human 
nutrition, agriculture, and biochemical processes 
within plant cells, much remains unclear about their 
biosynthesis and accumulation in plant tissues. This 
uncertainty has in turn limited the potential for 
manipulation of the synthesis and levels of these 
important compounds in plants. 
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Summary of the Invention 
In one embodiment, this invention provides a 
biologically pure sample of DNA which DNA comprises a 
sequence coding for the expression of Arabidopsis 
thaliana p-hydroxyphenyl pyruvic acid dioxygenase. 

In other embodiments, this invention provides a 
vector and microbial host containing a DNA sequence 
sufficiently homologous to SEQ ID NO:l so as to code 
for the expression of Arabidopsis thaliana p- 
hydroxyphenyl pyruvic acid dioxygenase, and a genetic 
construct containing a DNA sequence sufficiently 
homologous to SEQ ID NO : 1 so as to code for the 
expression of Arabidopsis thaliana p-hydroxyphenyl 
pyruvic acid dioxygenase, together with a promoter 
located 5' to the DNA coding sequence and a 3' 
termination sequence. 

In another embodiment, this invention provides a 
method of creating a transgenic plant in which the 
levels of the pOHPP dioxygenase enzyme are elevated 
sufficient such that production of plastoquinones , 
vitamin E, and carotenoids are modified. 

It is an object of the present invention to 
genetically engineer higher plants to modify the 
production of plastoquinones, vitamin E, and 
carotenoids. 

It is another object of the invention to provide 
transgenic plants that would express elevated levels 
of the pOHPP dioxygenase enzyme which would have 
resultant elevated resistance to the triketone class 
of herbicides (i.e. sulcotrione) . 

It is another object of the present invention to 
provide a method for the preparation of the enzyme p- 
hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , an enzyme which can be used to identify 
new pOHPPdioxygenase- inhibiting herbicides. 

Other features and advantages of the invention 



will be apparent from the following description of the 
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preferred embodiments thereof and from the claims. 



Brief Description of the Drawings 
Fig. 1 is a diagram of the pathways for synthesis 
of carotenoids, vitamin E (tocopherol) , and 
5 plastoquinone . 

Fig. 2 is a diagram of the interconnections of 
the pathways illustrated in Fig. 1. 

Fig. 3A-3E are graphs of pigment analyses of 
wild-type, NFZ-wt, and pdsl tissues. 
10 Fig. 4 is a physical map of the pdsl mutation 

relative to visible markers. 

Figs. 5A-5C present the results of C18 HPLC 
separation of lipid soluble pigments from wild-type 
plants on MS 2 media, homozygous pdsl mutants on MS 2 
15 media supplemented with pOHPP, and homozygous pdsl 

mutants on MS2 media supplemented with homogentistic 
acid (HGA) . 

Figs. 6A-6B present the results of C8 HPLC 
analyses of quinones in NFZ-wt and pdsl tissues. 

Detailed Description Of The Invention 
As described above, both Vitamin E, 
plastoquinones and carotenoids are synthesized and 
accumulated in plastids by the pathways shown in 
Figure 1. This specification describes the 
identification, isolation, characterization and 
functional analysis of a higher plant pOHPP 
dioxygenase cDNA, its role in a- toe, PQ and carotenoid 
synthesis, and the use of this cDNA to modify pOHPP 
dioxygenase activity in plant tissues and hence the 
accumulation of one or more of the compounds 
plastoquinones, vitamin E, and carotenoids in plant 
tissues. The overexpression of pOHPP dioxygenase in 
transgenic plants will modify the enzyme- to- inhibitor 
ratio of plant tissues exposed to triketone 
herbicides, as compared to non- transgenic plants, 
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• m 
increased herbicide resistance. The 

present specification also describes a genetic 
construct for use in the production of pOHPP 
dioxygenase, an enzyme useful in identifying new pOHPP 
5 dioxygenase- inhibiting herbicides. 

By genetic analysis the present inventors have 
shown that the vitamin E, plastoquinone , and 
carotenoid biosynthetic pathways are interconnected 
and share common elements as shown in Figure 2. From 
10 mutational studies in Arabidopsis thaliana, the 
present inventors identified one genetic locus, 
designated pdsl (pds= phytoene desaturation) , the 
disruption of which results in accumulation of the 
first carotenoid of the carotenoid biosynthetic 
15 . pathway, phytoene. Surprisingly, though this mutation 
disrupts carotenoid synthesis and was originally 
identified on this basis, it does not map to the locus 
encoding the phytoene desaturase enzyme . Evidence 
indicates that pdsl defines a second gene product in 
20 addition to the phytoene desaturase enzyme, necessary 
for phytoene desaturation and hence carotenoid 
synthesis in higher plants. This gene product proved 
to be pOHPP dioxygenase . 

To provide a molecular mechanism for manipulating 
25 synthesis and accumulation of the compounds 

plastoquinone, vitamin E, and carotenoids, the present 
inventors used a molecular genetic approach, taking 
advantage of the model plant system Arabidopsis 
thaliana to define, isolate and study genes required 
for synthesis of the compounds in plants. The 
flowering plant Arabidopsis thaliana has come into 
wide use as a model system to explore the molecular 
biology and genetics of plants. Arabidopsis offers 
many advantages for genetic analysis: it can be selfed 
and very large numbers of progeny can be obtained (up 
to 10,000 seeds from a single plant). Furthermore, 
Arabidopsis has a short generation time of five to six 
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weeks, so crosses can be set up and the progeny 
analyzed within reasonable periods of time. Mutation 
screens have identified thousands of mutations 
affecting many aspects of basic plant biology, 
5 including morphogenesis, photosynthesis, fertility, 

starch and lipid metabolism, mineral nutrition, an so 
on. In addition, its haploid genome is only about 10 8 
base pairs. 

An important aspect of the successful approach 

10 used here is that essential components were first 
functionally defined genetically, prior to their 
isolation, analysis and molecular manipulation. 
Briefly, potential mutants were identified by a 
combination of phenotypic and biochemical screening, 

15 characterized at the genetic and molecular levels, 

loci of interest selected, and the corresponding genes 
then cloned and studied further. By this approach, 
the inventors genetically defined and isolated cDNAs 
for one gene, pdsl, whose mutation disrupts synthesis 

20 of all three classes of compounds in the plastid, 

tocopherols, plastoquinones and carotenoids. Based on 
biochemical analysis of the pdsl mutant, the pdsl gene 
was identified as affecting the activity of pOHPP 
dioxygenase, a crucial enzyme of the plastidic quinone 

25 pathway in plants (Figure 1) , that is directly 
required for the synthesis of plastoquinone and 
a- tocopherol and indirectly f or carotenoid synthesis. 
In particular, the deduced function of the pdsl mutant 
and pOHPP dioxygenase enzyme are noted in Figure 2 . 

3 0 The present inventors demonstrated by biochemical 

complementation that the pdsl mutation affects the 
enzyme p-hydroxyphenyl pyruvic acid dioxygenase (pOHPP 
dioxygenase) , because pdsl plants can be rescued by 
growth on the product but not the substrate of this 

3 5 enzyme, homogentisic acid (HGA) and 

p-hydroxyphenylpyruvate (pOHPP) , respectively. pOHPP 
dioxygenase is the key branch point enzyme and 



10- 



BN80OCID: <WO_fl72728SA1JA> 



WO 97/27285 



PCT/US97/01384 



committed 




p in the synthesis of bo 




Vitamin E and 



10 




15 



20 



25 



30 



plastoquinones and several independent lines of 
biochemical evidence confirm pdsl affects this enzyme 
(Figures 1, 5, 6) . These results provide the first 
genetic evidence that plastoquinones are essential 
components for carotenoid synthesis in higher plants, 
most likely as an electron carrier/redox element in 
the desaturation reaction (Figure 2) . The Arabidopsis 
pOHPP dioxygenase gene/cDNA thus provides a basis for 
modifying the production of plastoquinones, 
a- tocopherol and carotenoids in all higher plants. 

Specifically, the specification describes the 
genetic identification of the Arabidopsis pOHPP 
dioxygenase gene by mutational analysis, the physical 
isolation and functional confirmation of an 
Arabidopsis pOHPP dioxygenase cDNA, its nucleotide 
sequence and its use to isolate pOHPP dioxygenase 
genes and cDNAs from other plant species. Also 
included in the specification is a description of the 
use of the Arabidopsis pOHPP dioxygenase cDNA, and 
related cDNAs from other plants, to positively or 
negatively modify the expression/activity of pOHPP 
dioxygenase by recombinant techniques (overexpression, 
cosuppression, antisense, etc.) in any and all plant 
tissues, especially leaf and fruit tissues, to 
positively or negatively affect the production of 
<*-toc, PQ and carotenoids. 

Elevating pOHPP dioxygenase protein levels 
increases the amount of homogentisic acid (HGA) 
synthesized in plant tissues. Because HGA is the 
limiting precursor molecule for cr-toc. and PQ synthesis 
(the end products of the pathway), increasing HGA 
synthesis increases the levels of cr-toc (Vitamin E) 
and PQ in plant tissues. The increase in PQ 
indirectly increases the synthesis of carotenoids, 
which require PQ for their synthesis. In addition, 
the increase in PQ increases photosynthet ic efficiency 



-11- 



BN8DOGD: <WO_9727286A1JA> 



WO 97/27285 



PCT/US97/01384 



10 



15 



20 



25 



30 



by increasing electron flow between "photosystem II and 
photosystem I, because PQ is the primary electron 
transporter between the two photosystems . The 
increase in Qf-toc, a well -studied antioxidant in 
mammals, increases the ability of plants to withstand 
oxidative stresses, such as that caused by high light, 
high temperature, water stress, ozone stress, UV 
stress or other abiotic or biotic stresses. Elevating 
the levels of pOHPP dioxygenase will modify the dose 
response curve of herbicides targeting pOHPP 
dioxygenase, thus increasing the relative resistance 
to such herbicides in transgenic plants as compared to 
native plants of the same species. Inhibiting the 
expression of pOHPP dioxygenase is expected to have 
the opposite effect. 

Genetic Construct 

To express pOHPP dioxygenase in a plant, it is 
required that a DNA sequence containing the pOHPP 
dioxygenase coding sequence be combined with 
regulatory sequences capable of expressing the coding 
sequence in a plant. A number of effective plant 
promoters, both constitutive and developmentally or 
tissue specific, are known to those of skill in the 
art . A transcriptional termination sequence 
(polyadenylation sequence) may also be added. Plant 
expression vectors, or plasmids constructed for 
expression of inserted coding sequences in plants, are 
widely used in the art to assemble chimeric plant 
expression constructs including the coding sequence, 
and to conveniently transfer the constructs into 
plants. A sequence which codes for pOHPP dioxygenase 
includes, for example, SEQ ID NO : 1 , or versions of the 
designated sequence sufficient to effect coding for 
the expression of pOHPP dioxygenase. Commonly used 
methods of molecular biology well-known to those of 
skill in the art may be used to manipulate the DNA 
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sequences.^ 

By "genetic construct" we mean any of a variety 
of ways of combining the protein-encoding sequences 
with a promoter sequence (and termination sequence, if 
necessary) in a manner that operably connects the 
promoter sequence (and termination sequence, if 
present) with the protein-encoding sequences. 
Typically, the promoter sequence will be "upstream" of 
a protein-encoding sequence, while the termination 
sequence, if used, will be "downstream" of the 
protein-encoding sequences . 

The protein-encoding, promoter and termination 
sequences may be combined on a plasmid or viral 
vector, and inserted into a microbial host. Other 
functional sequences may be added to the gene 
construct. Alternatively, the protein-encoding, 
promoter, and termination sequence, if added, may be 
combined with any other needed functional sequences 
and used without a vector. 

The DNA sequence described by SEQ ID NO:l is 
sufficient to effect coding for the expression of 
pOHPP dioxygenase. However, it is envisioned that the 
above sequence could be truncated and still confer the 
same properties. It is not known at present which 
specific deletions would be successful, but it is 
likely that some deletions to the protein would still 
result in effective enzymatic activity. One skilled 
in the art of molecular biology would be able to take 
the designated sequence and perform deletional 
analysis experiments to determine what portions of the 
designated sequence are essential to effect coding for 
the expression of pOHPP dioxygenase . One could create 
a genetic construct with the candidate deletion 
mutations and perform experiments as described below 
in the Examples, to test whether such deletion 
mutation sequences effect coding for the enzyme. 
Expression of the enzyme activity indicates a 
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successful deletion mutant or mutants. In this 
manner, one could determine which parts of the 
designated sequence is essential for expression of the 
enzyme . 

5 It is also known that the genetic code is 

degenerate, meaning that more than one codon, or set 
of three nucleotides, codes for each amino acid. Thus 
it is possible to alter the DNA coding sequence to a 
protein, such as the sequence for pOHPP dioxygenase 

10 described here, without altering the sequence of the 

protein produced. Selection of codon usage may affect 
expression level in a particular host. Such changes 
in codon usage are also contemplated here. 

It is further contemplated that using the 

15 Arabidopsis pOHPP gene coding sequence described here, 
that the homologous pOHPP dioxygenase sequences from 
other higher plants can be readily recovered. 
Oligonucleotides can, be made from the sequence set 
forth below to either hybridize against cDNA or 

20 genomic libraries or used for PCR amplification of 
homologous pOHPP dioxygenase sequences from other 
plants . 

Once a pOHPP gene is in hand, whether from 
Arabidopsis or from some other plant species, it then 

25 becomes possible to insert a chimeric plant expression 
genetic construct into any plant species of interest. 
Suitable plant transformation methods exist to insert 
such genetic constructs into most, if not all, 
commercially important plant species. Presently known 

30 methods include Agrobacterium-mediated transformation, 
coated-particle gene delivery (Biolistics) and 
electroporat ion, in which an electric voltage is used 
to facilitate gene insertion. All these methods, and 
others, can insert the genetic construct into the 

35 genome of the resulting transgenic plant in such a way 
that the genetic construct becomes an inheritable 
trait, transmitted to progeny of the original 
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inheritance. Thus, once a genetic construct 
expressing a pOHPP gene is inserted into a plant, it 
can become a part of a plant breeding program for 
transfer into any desired genetic background. 

To over-express pOHPP dioxygenase, a genetic 
construct may be used with a higher strength promoter. 
To inhibit expression of endogenous pOHPP dioxygenase, 
an antisense genetic construct can be made, as is 
known by those of skill in the art, to reduce the 
level of pOHPP dioxygenase present in the plant 
tissues. 



Isolation of pdal, a mutant defective in carotenoid 
synthesis 

To further understand carotenoid biosynthesis and 
its integration with other pathways in the chloroplast 
in higher plants, the present inventors studied the 
pathway by isolating Arabidopsis thaliana mutants 
that are blocked in carotenoid synthesis. 

Plants homozygous for defects in the early stages 
of carotenoid synthesis (e.g. prior to production of 
/3-carotene) are lethal when grown in soil and the 
isolation of such mutations requires the design of 
screening procedures to identify plants heterozygous 
for soil lethal mutations. The present inventors 
found that most soil lethal, homozygous 
pigment -deficient Arabidopsis mutants can be grown to 
near maturity in tissue culture on Murashige and Skoog 
basal media (Murashige and Skoog, Phvsiol . Plant. 
15:473-497, 1962) supplemented with sucrose (MS 2 
media) . Under these conditions, photosynthesis and 
chloroplast development are essentially dispensable 
and all the energy and nutritional needs of the plant 
are supplied by the media. 

Greater than 500 lines from the 10,000 member 
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Feldmann T-DNA tagged Arabidapsis thaliana population 
(Forsthoefel et al . , Aust . J. Plant Phvsiol . 19:353- 
366, 1992) were selected for pigment analysis based on 
their segregation for lethal pigment mutations. Seed 
from plants heterozygous for lethal pigment mutations 
were surf ace sterilized, grown on MS 2 media, the 
segregating pigment mutants identified, tissue 
harvested from individual plants, and HPLC pigment 
analysis performed. Although numerous mutant lines 
with severe pigment deficiencies were identified, only 
two were found to be carotenoid biosynthetic mutants. 
One mutant line isolated from this group, pdsl, is 
described in detail here. 

The hallmark phenotype for disruption of a 
biosynthetic pathway is the accumulation of an 
intermediate compound prior to the site of blockage. 
Such blockage of the carotenoid pathway can be 
mimicked chemically by treatment of wild- type plants 
with the herbicide NFZ , an inhibitor of the phytoene 
desaturase enzyme (Figure 1) which has been reported 
to cause accumulation of phytoene in treated tissues. 
Britton, Z. Naturforsch 34c : 9-79 - 985 , 1979. Figs. 3A- 
3E present the results of pigment analysis of wild- 
type, NFZ-wt, and pdsl tissues. Abbreviations in 
Figs. 3A-3E are as follows: N, neoxanthin; V, 
violaxanthin; L, lutein; Cb, chlorophyll b ; Ca , 
chlorophyll a; /3, /8-carotene. 

Figure 3A shows C 18 Reverse Phase HPLC analysis of 
the carotenoids that accumulate in wild- type 
Arabidopsis thaliana leaves. In comparison, Figure 3B 
shows the pigment profile for NFZ treated wild- type 
(NFZ-Wt). Spectral analysis of the strongly absorbing 
296nm peak at 33 minutes in NFZ-Wt tissue shows 
absorbance maxima at 276, 286, and 298nm, indicative 
of phytoene (Figure 3D) . Figure 3C shows pigment 
analysis of tissue culture grown homozygous pdsl 
mutant plants. The low absorbance at 44 0nm in Figures 
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3B and C demonstrates that like NFZ-Wt, pdsl mutants 
lack all chlorophylls and carotenoids that normally 
accumulate in wild-type tissue (compare to Figure 3A) . 
However, unlike wild- type, pdsl mutants contain a peak 



absorbs strongly at 296nm. The retention time and 
absorbance of the 3 3 -minute peak in the pdsl mutant 
corresponds to the phytoene peak in pigment extracts 
of NFZ-Wt tissue (Figure 3B) . . Spectral analysis of 
the 33 -minute peak from pdsl is shown in Figure 3E and 
is virtually identical to the spectra of phytoene from 
NFZ-Wt tissue (Figure 3D) as well as to the published 
spectra for phytoene. These results confirm the 
chemical identity of the accumulating compound in pdsl 
as phytoene and conclusively demonstrate that the pdsl 
mutation disrupts carotenoid biosynthesis. 

Carotenoid Analysis 

For quantitative and qualitative carotenoid 
analysis, plant tissue is placed in a microfuge tube 
and ground with a micropestle in 200/xl of 80% acetone. 
120/21 of ethyl acetate is added and the mixture 
vortexed. 14 0/il of water is added and the mixture 
centrifuged for 5 minutes. The carotenoid containing, 
upper phase is then transferred to a fresh tube and 
vacuum dried in a Jouan RC1010 Centrifugal Evaporator. 
The dried extract is resuspended in ethyl acetate at a 
concentration of 0 . 5mg fresh weight of tissue per jil 
and either analyzed immediately by HPLC or stored at 
-80*C under nitrogen. 

Carotenoids were separated by reverse -phase HPLC 
analysis on a Spherisorb ODS2 5 micron C 18 column, 25 
cm in length (Phase Separations Limited, Norwalk, CT) 
using a 45 minute gradient of Ethyl Acetate (0-100%) 
in Acetonitrile/water/triethylamine (9:1:0.01 v/v) , at 
a flow rate of l ml per minute (Goodwin and Brit ton, 
1988) . Carotenoids were identified by retention time 



with a retention time at approximately 33 minutes that 
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relative to known standards with detection at both 
296nm and 440nm. When needed, absorption spectra for 
individual peaks were obtained with a Hewlett Packard 
104 OA photodiode array detector and compared with 
published spectra or available standards. 



Quinone analysis 

Quinones were extracted from tissue using a 
method modified from that described in Bligh et al . , 
Can. J. Biochem. Physiol. 37:911-917, 1959. Frozen 

10 plant tissue was ground in a mortar with 3 volumes of 
chloroform and 6 volumes of methanol and transferred 
to a test tube. . Water and additional chloroform were 
added until a biphasic mixture was obtained. The 
quinone containing chloroform phase was then 

15 collected. To increase yields, the aqueous phase was 
back-extracted with chloroform, the two chloroform 
phases pooled, and then filtered through Whatman #3 
filter paper. The resulting filtrate was dried under 
a constant stream of nitrogen. Once dried, the pellet 

2 0 was resuspended in methanol at a concentration of lOmg 
fresh weight per ml and immediately analyzed by HPLC. 
Quinones were resolved by reversed-phase HPLC analysis 
on a LiChrosorb RP-8, 5 micron column, 25cm in length, 
(Alltech, San Jose, CA) using an isocratic solvent of 

25 10% H 2 0 in Methanol for the first 14 minutes, at which 
time the solvent was switched to 100% methanol for the 
remainder of the run (modified from the method 
described in Lichtenthaler , Handbook of 1 
Chromatography * CRC Press, 115-159, 1984) . The flow 

30 rate was 1ml per minute for the duration. Peaks were 
identified based upon the retention time of known 
standards with detection at 280nm for or- tocopherol and 
260nm for plastoquinone and ubiquinone as well as by 
absorption spectra from a Hewlett Packard 104 OA 

35 photodiode array detector. When needed, fractions 

represented by individual chromatographic peaks were 
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collected,^^d submitted to the South^^^t 

Environmental Health Science Center, Analytical Core 
laboratory for mass spectral analysis. Results were 
obtained using a TSQ7000 tandem mass spectrometer 
5 (Finnigan Corp,, San Jose, CA) equipped with an 

atmospheric pressure chemical ionization source 
operated in the positive ion mode. The instrument was 
set to unit resolution and the samples were introduced 
into. the source in a 0 . 3 ml/minute methanol stream and 
10 ionized using a 5kV discharge. 

Genetic analysis of pdsl 

The genetic nature of the pdsl mutation was 
determined by analyzing seeds resulting from selfing 
pdsl heterozygous plants. Prior to desiccation, Fl 

15 seeds were scored as either green (wild-type or 
heterozygous) or white (homozygous). A 3:1 
segregation ratio was observed (146 green seeds: 48 
white seeds) , indicating that pdsl is inherited as 
single recessive nuclear mutations (X 2 =0.01, p > 0.90). 

20 Because pdsl mutants are inhibited in the desaturation 
of phytoene, the inventors believed that it might be. a 
mutation in the phytoene desaturase enzyme, which had 
previously been mapped to chromosome 4, between ag and 
bp. Wetzel et al . , Plant J. 6:161-175, 1994. To test 

25 this hypothesis, the pdsl mutation was mapped relative 
to visible markers. The pdsl mutation was found to 
map to chromosome 1, approximately 7 cM from disl 
toward clv2. Franzmann et al . , Plant J. 7:341-350, 
1995. These data points are summarized in Figure 4 

30 and establish that pdsl does not map to the phytoene 
desaturase enzyme locus, thus proving that the pdsl 
mutation is not in the phytoene desaturase enzyme. 
This data provided important insight for 
characterization of the pdsl mutant . 
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Homozygous pdsl mutants can be rescued by Homogentisic 
Acid, an intermediate in plastoquinone and tocopherol 
biosynthesis 

As described earlier, previous research 
5 suggesting a role for quinones and pOHPP dioxygenase 

in phytoene desaturation lead the present inventors to 
investigate the quinone biosynthetic pathway in the 
pdsl mutant. The early stages of 

plastoquinone/ tocopherol synthesis were functionally 
10 analyzed by growth in the presence of two intermediate 
compounds in the pathway, p-hydroxyphenylpyruvate 
(pOHPP) and homogentisic acid (HGA) (refer to Figures 
1 and 2) . Albino pdsl homozygous plants were first 
germinated on MS2 media and then transferred to MS2 
15 media supplemented with IOOjxM of either pOHPP or HGA. 

pdsl plants remained albino when transferred to media 
containing pOHPP but greening occurred when pdsl 
plants were transferred to media containing HGA. 
Figs. 5A-5C present the results of complementation of 
20 the pdsl mutation with homogentisic acid. Each 

profile represents pigments extracted from lOmg fresh 
weight of tissue. Abbreviations used in Figs. 5A-5C 
are as described in Figs. 3A-3E. HPLC analysis with 
detection at 440nm of the carotenoids extracted from 

2 5 pdsl plants grown on pOHPP and HGA are shown in 

Figures 5B and C, respectively. The pigment profiles 
of pdsl mutants grown on pOHPP are similar to the 
profiles of pdsl plants grown on MS2 media shown in 
Figure 3B. Comparison of the pigment profiles for 
30 pdsl + HGA tissue and wild-type tissue (Figures 5A and 
5C) indicates that growth in the presence of HGA is 
able to qualitatively restore a wild- type carotenoid 
profile to albino, homozygous pdsl plants. These 
results indicate that the pdsl mutation affects the 

3 5 enzyme pOHPP dioxygenase, because pdsl mutants are not 

altered by growth on the substrate of this enzyme, 
pOHPP, but rather, are restored qualitatively to 
wild- type pigmentation by growth on the product of 
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this enzyme, HGA (refer to Figures 1 and 2) . The 
complementation of pdsl with HGA also indicates that 
intermediates or end products of this pathway 
(plastoquinone and/or tocopherols, refer to Figures 1 
and 2) are necessary components for phytoene 
desaturation in plants and confirms the observation of 
Schultz et al, in FEBS where inhibitors of pOHPP 
dioxygenase were shown to cause accumulation of 
phytoene . 

HPLG analysis conclusively demonstrates that pdsl is a 
mutation in the plastoquinone/ tocopherol biosynthetic 
pathway that also affects carotenoid synthesis 

In addition to biochemical complementation of 
pdsl mutants, the plastoquinone/tocopherol pathway was 
also directly analyzed in pdsl tissue by utilizing C a 
HPLC to resolve total lipid extracts and identify 
three separate classes of quinones : ubiquinone, 
plastoquinone, and a- tocopherol (Vitamin E) (Figures 5 
and 6) . Ubiquinone and plastoquinone perform 
analogous electron transport functions in the 
mitochondria and chloroplast, respectively, but are 
synthesized by different pathways in separate 
subcellular compartments (Goodwin et al . , Introduction 
to Plant Biochemistry . Oxford, Pergamon Press, 1983), 
making ubiquinone an ideal internal control in these 
analyses. Figure 6 shows the C e HPLC analysis of lipid 
soluble extracts from NFZ-Wt tissue and pdsl tissue. 
In NFZ-Wt tissue (Figure 6A) , peaks 3 and 4 were 
identified as ubiquinone and plastoquinone, 
respectively, based on retention time (26 and 27 
minutes) , optical spectra, and mass spectra (results 
not shown) . NFZ-Wt tissue contained a peak (l) with a 
retention time of 13.5 minutes which was identified as 
or- tocopherol based upon the retention time of a 
standard. However, optical spectroscopy and mass 
spectrometry demonstrated that peak l was composed of 
two major components: a- tocopherol (la) and an 
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unidentified compound (lb) . The mass of a- tocopherol 
was determined to be 43 0 as indicated by the presence 
of the 431 protonated molecule while the molecular 
mass of the unidentified compound was 412, as 
indicated by the presence of the 413 protonated 
molecule (data not shown) , clearly demonstrating the 
presence of two compound in peak 1. This quinone 
analysis demonstrates that the herbicide NFZ, which 
specifically inhibits the phytoene desaturase enzyme, 
does not affect synthesis of homogentisate derived 
quinones. pdsl tissue (Figure 6B) contain ubiquinone 
(peak 3) but lack plastoquinone (peak 4) . 
Additionally, though pdsl contains a peak at 13.5 
minutes, optical spectroscopy and mass spectrometry 
data demonstrate that this peak lacks a-tocopherol 
(la) and is composed solely of the compound lb (data 
not shown) . Therefore, homozygous pdsl plants 
accumulate ubiquinone but lack both plastoquinone and 
af-tocopherol . This is consistent with the pdsl 
mutation affecting pOHPP dioxygenase (refer to Figures 
1 and 2) , as suggested by the rescue of the mutation 
by HGA, and provide additional evidence that the pdsl 
mutation disrupts pOHPP dioxygenase. 

Isolation of a truncated, putative pOHPP dioxygenase 
Arabi dopa i s cDNA 

The observation of Schultz et al . demonstrating 
that inhibitors of pOHPP dioxygenase activity disrupt 
carotenoid synthesis and cause accumulation of 
phytoene provided important insight for the 
characterization of the pdsl mutant which in turn 
provided the present inventors with important insight 
for the isolation of a putative cDNA for the pdsl 
locus. In animals, genetic defects which inhibit the 
activity of pOHPP dioxygenase lead to tyrosinemia type 
I, a fatal inherited disease in aromatic amino acid 
catabolism characterized by the presence of high 
levels of pOHPP in the urine. 
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In an^ffort to further understand the nature of 
this disease, pOHPP dioxygenase cDNAs have been cloned 
from several mammalian and bacterial sources 
(summarized in Ruetschi et al., Eur. J. Biochem. 



various mammalian pOHPP dioxygenase enzymes is >80%; 
in comparison, their identity to bacterial homologs is 
very low, less than 28%. By using mammalian and 
bacterial sequences to search the Expressed Sequence 
Tags (ESTs) computer DNA database (Newman et al., 
Plant Phvsiol. 106:1241-1255, 1994), one partial 
length Arabidopsis EST was identified and used as a 
probe. The partial length Arabidopsis probe 
corresponds to base pairs 1072 through 1500 of SEQ ID 
NO:l. 

This cDNA contained only 99 amino acids of the 
carboxyl terminal portion of the protein coding 
region. The deduced protein sequence of this putative 
Arabidopsis pOHPP dioxygenase cDNA shows similar 
homology (-50% identity) to both the mammalian and 
bacterial pOHPP dioxygenases . Interestingly, the 
partial Arabidopsis sequence also contains a 15 amino 
acid insertion not found in the human or bacterial 
enzymes. Finally, alignment of six pOHPP dioxygenase 
sequences from mammals and bacteria identified three 
regions of high conservation, the highest being a 16 
amino acid region near the carboxy end of pOHPP 
dioxygenases that shows 62.5% identity across all 
phyla. Ruetschi et al . , Eur. J. Biochem. 205:459-466, 
1992. This region is also present in the truncated 
Arabidopsis sequence. The lines of evidence suggest 
that the partial length Arabidopsis cDNA described 
above encodes a pOHPP dioxygenase, most likely the 
pdsl locus. 



205 :459-466, 



1992). Amino acid identity between 
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Isolation and Characterization of a full length 
Arabidops ispOHPP dioxygenase cDNA 

Utilizing the partial length Arabldopsls cDNA 
probe, an Arabidopsis cDNA library was screened by 
nucleic acid hybridization for full length cDNAs . A 
large number of hybridizing cDNAs were isolated, and 
one of the longest, pHPPl.5, containing a 1/52 0 bp 
insertion, was sequenced completely; ■ the insert is 
presented as SEQ ID NO: 1. pHPPl . 5 encodes a 446 
amino acid protein (presented as SEQ ID NO: 2) , which 
is slightly larger in size than mammalian and 
bacterial pOHPP dioxygenases . pHPPl . 5 shows 34-4 0% 
identity at the amino acid level to pOHPP dioxygenases 
from various mammals and bacteria. In comparing four 
bacterial pOHPP dioxygenases and one mammalian pOHPP 
dioxygenases (pig) which ranged in size from 346-404 
amino acids, Denoya et al / identified 69 amino acids 
that were conserved between all five pOHPP 
dioxygenases. Denoya et al . , J. Bacteriol. 176:5312- 
5319, 1994. The pHPPl . 5 coding region contains 52 of 
these 6 9 conserved amino acids. 

Demonstration that pHPPl.5 encodes an active pOHPP 
dioxygenase protein and complements the pdsl mutation 

In order to definitively demonstrate that pHPPl . 5 
is the gene product encoded by the pds2 locus and that 
it encodes a functional pOHPP dioxygenase protein, the 
pHPPl.5 cDNA was cloned into a plant transformation 
vector for molecular complementation experiments with 
the pdsl. The full length wild-type pOHPP dioxygenase 
cDNA will be subcloned into a plant transformation 
vector driven by the Cauliflower Mosaic Virus 3 5S 
(CaMV) promoter and containing all necessary 
termination cassettes and selectable markers (Kan r ) . 
The CaMV promoter is a strong constitutive promoter. 
This single construct and the vector without the 
pHPPl.5 insert (as a control) will be used in vacuum 
infiltration transformation which uses whole soil 
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grown plan^^^md will be done on plan^^ihat are 

heterozygous for the pdsl mutation. Bouchez et al., CR 
Acad. Sci. Paris, Sciences de la vie 316, 1993. 

In the standard procedure, 20-3 0 soil grown 
5 plants will be independently transformed and analyzed 
separately. In this case homozygous plants containing 
the pdsl mutation would be lethal while heterozygous 
plants containing the pdsl mutation would be 
segregating 2:1 for the pdsl mutation in their 
siliques. The inventors will use a similar number of 
wild type plants in a parallel transformation as a 
control. After transformation of the pdsl segregating 
plant population, as the plants are setting seed the 
inventors can easily identify those heterozygous for 
15 the pdsl mutation in retrospect by inspection of their 
siliques which would contain green : white embryos in a 
3 : 1 ratio . 

Seed harvested from individually transformed 
heterozygous pdsl plants will be germinated on 

20 kanamycin and resistant seedlings transferred to soil. 
Segregation analysis of seed from these primary 
transformants (T2 seed) and T3 seed for segregation of 
the pdsl phenotype (albino^ and phytoene accumulating )' ; 
and the T-DNA encoded kanamycin resistance marker 

25 (wild type pOHPP dioxygenase cDNA) will conclusively 

demonstrate complementation of the pdsl mutation with 
the pOHPP dioxygenase cDNA. To provide additional 
proof that pHPPl . 5 is encoded by the pdsl locus, the 
pHPPl.5 cDNA has been mapped relative to the pdsl 

3 0 locus using recombinant inbred lines, as described in 
Lister, et al . , Plant_sL_ 4 : 745-750 , 1993. The- pHPPl . 5 
cDNA mapped to the region of chromosome 1 containing 
the pdsl mutation (Figure 4). Finally, the pHPPl . 5 
cDNA will be overexpressed in E. coli and the activity 

35 of the protein determined. 
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Modification of pOHPP Expression 

From the genetic and biochemical studies 
described above it is clear that only one pOHPP 
dioxygenase gene product is involved in chloroplast ic 
5 quinone synthesis, that the pdsl mutation defines this 
gene, that the pHPPl . 5 cDNA is the product encoded by 
the pdsl locus and that disruption of its function 
completely eliminates Vitamin E production and 
plastoquinone and carotenoid synthesis in plant 
10 tissues. Modification of pOHPP dioxygenase expression 
in plants by molecular techniques using pHPPl . 5 can 
therefore be used to positively or negatively affect 
the production of tocopherols, plastoquinones directly 
and carotenoids indirectly (refer to Figures 1 and 2) . 
15 Specifically, overexpression of the pOHPP dioxygenase 
enzyme will result in increased levels of one or more 
of these compounds in the tissues of transgenic 
plants. Alternatively, using antisense techniques, it 
is possible to lower the level of enzyme activity to 
20 decrease the levels of these compounds in plants.. 

Additionally, overexpression of the pOHPP dioxygenase 
will enable a transgenic plant to withstand elevated 
levels of herbicides that target this enzyme, 
providing agrinomically significant herbicide 
25 resistance relative to normal plants. 

Two different plant systems, Arabidopsis and 
tomato, are being used to demonstrate the effects of 
modified pOHPP dioxygenase in plant tissues. 
Constitutive overexpression of pOHPP dioxygenase will 
30 be done in both plant systems utilizing the CaMV 35S 
promoter and the pHPP1.5 cDNA. The consequences of 
this altered expression on tocopherol, plastoquinone 
and carotenoid levels and profiles in various plant 
tissues will be determined as described below. In 
35 tomato, tissue specific overexpression of pOHPP 

dioxygenase (pHPPl.5) will be driven by the fruit 
specific promoter derived from the tomato /Jsubunit 
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gene, which is expressed specifically in developing, 
but not ripening tomato fruit. This will determine 
the potential for modifying the levels of tocopherol, 
plastoquinone and carotenoids specifically in 
developing and ripening fruit for nutritional purposes 
without affecting their production in other plant 
tissues. These combined experiments will determine 
whether pOHPP dioxygenase is a rate limiting step in 
chloroplastic homogentisic acid derived quinone 
synthesis and the potential for manipulating 
chloroplastic homogentisic acid derived quinones 
(tocopherols and plastoquinones ) and compounds that 
require quinones for their synthesis (carotenoids, 
etc) by increasing pOHPP dioxygenase activity. 

Multiple independent transf ormants will be 
produced for each construct and plant species used. 
The integration and gene copy number of each chimeric 
gene in each line will be confirmed by southern 
analysis, the level of pOHPP mRNA determined by 
Northern blot analysis, pOHPP dioxygenase activity 
determined .as described in Schulz et al . , FEBS 
318:162-166, 1993, and the effects on individual 
chloroplastic components of interest analyzed 
(tocopherols, plastoquinones and carotenoids) . In 
green tissue containing const itutively expressing 
constructs this analysis can occur relatively soon 
after transf ormants are put into soil. Analysis of 
fruit specific construct lines will require much more 
time for fruit set to occur. Analysis of tocopherols, 
plastoquinones and carotenoids will be by a 
combination of HPLC, optical and mass spectra as 
described in Norris et al. (1995, in press). Analysis 
of tocopherol levels is performed by HPLC and when 
needed by GCrmass spectroscopy in selected ion mode. 
In MS analysis the absolute level of tocopherol will 
be quantified by isotopic dilution with a known, 
"heavy carbon" tocopherol standard added at the start 
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of the extraction. Determination based on fresh 
weight of tissue can also be performed. Plastoquinone 
levels will be quantified by C8 HPLC and optical 
spectra as described in Norris et al . (1995, in 
press) . Total carotenoid levels are determined 
spectrophotometrically and the levels of individual 
carotenes quantified by C18 HPLC and optical spectra 
quantified to standards. In the course of these 
experiments we will identify high expressing lines 
with simple insertions that segregate as single 
genetic loci in progeny. This will facilitate 
analysis of the inheritance of the gene and phenotype 
in future generations . 

Qverexpression of pQHPP for in vitro Herbicide 
Analysis 

pOHPP dioxygenase will be overexpressed in E . 
coli or other prokaryotic or eukaryotic protein 
production systems and purified in large amounts for 
use in enzymatic assays for identifying new herbicide 
compounds (pOHPP inhibitors) and optimizing existing 
chemistries through detailed kinetic analysis. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCACGCGTCC GAGTTTTAGC AGAGTTGGTG AAATCA ATG GGC CAC CAA AAC GCC 54 

Met Gly His Gin Asn Ala 
1 5 

5 GCC GTT TCA GAG AAT CAA AAC CAT GAT GAC GGC GCT GCG TCG TCG CCG 102 

Ala Val Ser Glu Asn Gin Asn His Asp Asp Gly Ala Ala Ser Ser Pro 
!0 15 20 

GGA TTC AAG CTC GTC GGA TTT TCC AAG TTC GTA AGA AAG AAT CCA AAG 150 
Gly Phe Lys Leu Val Gly Phe Ser Lys Phe Val Arg Lys Asn Pro Lys 
10 25 30 35 

TCT GAT AAA TTC AAG GTT AAG CGC TTC CAT CAC ATC GAG TTC TGG TGC 198 
Ser Asp Lys Phe Lys Val Lys Arg Phe His His lie Glu Phe Trp Cys 
40 45 50 

GGC GAC GCA ACC AAC GTC GCT CGT CGC TTC TCC TGG GGT CTG GGG ATG 24 6 
15 Gly Asp Ala Thr Asn Val Ala Arg Arg Phe Ser Trp Gly Leu Gly Met 

55 60 65 70 

AGA TTC TCC GCC AAA TCC GAT CTT TCC ACC GGA AAC ATG GTT CAC GCC 2 94 
Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr Gly Asn Met Val His Ala 
75 80 85 

2 0 TCT TAC CTA CTC ACC TCC GGT GAC CTC CGA TTC CTT TTC ACT GCT CCT 342 

Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg Phe Leu Phe Thr Ala Pro 
90 95 100 

TAC TCT CCG TCT CTC TCC GCC GGA GAG ATT AAA CCG ACA ACC ACA GCT 3 90 
Tyr Ser Pro Ser Leu Ser Ala Gly Glu lie Lys Pro Thr Thr Thr Ala 
25 105 110 115 

TCT ATC CCA AGT TTC GAT CAC GGC TCT TGT CGT TCC TTC TTC TCT TCA 43 8 
Ser He Pro Ser Phe Asp His Gly Ser Cys Arg Ser Phe Phe Ser Ser 
120 125 130 

CAT GGT CTC GGT GTT AGA GCC GTT GCG ATT GAA GTA GAA GAC GCA GAG 4 86 

3 0 His Gly Leu Gly Val Arg Ala Val Ala He Glu Val Glu Asp Ala Glu 

135 140 145 150 

TCA GCT TTC TCC ATC AGT GTA GCT AAT GGC GCT ATT CCT TCG TCG CCT 5 34 
Ser Ala Phe Ser lie Ser Val Ala Asn Gly Ala He Pro Ser Ser Pro 
155 160 165 

3 5 CCT ATC GTC CTC AAT GAA GCA GTT ACG ATC GCT GAG GTT AAA CTA TAC 582 

Pro He Val Leu Asn Glu Ala Val Thr He Ala Glu Val Lys Leu Tyr 
170 175 180 

GGC GAT GTT GTT CTC CGA TAT GTT AGT TAC AAA GCA GAA GAT ACC GAA 630 
Gly Asp Val Val Leu Arg Tyr Val Ser Tyr Lys Ala Glu Asp Thr Glu 
40 185 190 195 

AAA TCC GAA TTC TTG CCA GGG TTC GAG CGT GTA GAG GAT GCG TCG TCG 678 
Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg Val Glu Asp Ala Ser Ser 
200 205 210 

TTC CCA TTG GAT TAT GGT ATC CGG CGG CTT GAC CAC GCC GTG GGA AAC 726 

4 5 Phe Pro Leu Asp Tyr Gly He Arg Arg Leu Asp His Ala Val Gly Asn 

215 220 225 230 

GTT CCT GAG CTT GGT CCG GCT TTA ACT TAT GTA GCG GGG TTC ACT GGT 7 74 
Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr Val Ala Gly Phe Thr Gly 
235 240 245 
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TTT CAC CAA TTC GCA GAG TTC ACA GCA GAC GAC GTT GGA ACC GCC GAG 82 2 
Phe His Gin Phe Ala Glu Phe Thr Ala Asp Asp Val Gly Thr Ala Glu 
250 255 260 

AGC GGT TTA AAT TCA GCG GTC CTG GCT AGC AAT GAT GAA ATG GTT CTT 8 70 
5 Ser Gly Leu Asn Ser Ala Val Leu Ala Ser Asn Asp Glu Met Val Leu 

265 270 275 

CTA CCG ATT AAC GAG CCA GTG CAC GGA ACA AAG AGG AAG AGT CAG ATT 918 
Leu Pro He Asn Glu Pro Val His Gly Thr Lys Arg Lys Ser Gin He 
280 285 290 

10 CAG ACG TAT TTG GAA CAT AAC GAA GGC GCA GGG CTA CAA CAT CTG GCT 96 6 

Gin Thr Tyr Leu Glu His Asn Glu Gly Ala Gly Leu Gin His Leu Ala 
295 300 305 310 

CTG ATG AGT GAA GAC ATA TTC AGG ACC CTG AGA GAG ATG AGG AAG AGG 1014 
Leu Met Ser Glu Asp He Phe Arg Thr Leu Arg Glu Met Arg Lys Arg 
15 315 320 325 

AGC AGT ATT GGA GGA TTC GAC TTC ATG CCT TCT CCT CCG CCT ACT TAC 1062 
Ser Ser He Gly Gly Phe Asp Phe Met Pro Ser Pro Pro Pro Thr Tyr 
330 335 340 

TAC CAG AAT CTC AAG AAA CGG GTC GGC GAC GTG CTC AGC GAT GAT CAG 1110 
Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp Val Leu Ser Asp Asp Gin 
345 350 355 

ATC AAG GAG TGT GAG GAA TTA GGG ATT CTT GTA GAC AGA GAT GAT CAA 1158 
He Lys Glu Cys Glu Glu Leu Gly He Leu Val Asp Arg Asp Asp Gin 
360 365 370 

25 GGG ACG TTG CTT CAA ATC TTC ACA AAA CCA CTA GGT GAC AGG CCG ACG 12 06 

Gly Thr Leu Leu Gin He Phe Thr Lys Pro Leu Gly Asp Arg Pro Thr 
375 380 385 390 

ATA TTT ATA GAG ATA ATC CAG AGA GTA GGA TGC ATG ATG AAA GAT GAG 12 54 
He Phe He Glu He He Gin Arg Val Gly Cys Met Met Lys Asp Glu 
30 395 400 405 

GAA GGG AAG GCT TAC CAG AGT GGA GGA TGT GGT GGT TTT GGC AAA GGC 13 02 
Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys Gly Gly Phe Gly Lys Gly 
410 415 420 

AAT TTC TCT GAG CTC TTC AAG TCC ATT GAA GAA TAC GAA AAG ACT CTT 13 50 
Jo Asn Phe Ser Glu Leu Phe Lys Ser He Glu Glu Tyr Glu Lys Thr Leu 

425 430 435 

GAA GCC AAA CAG TTA GTG GGA TGA ACAAGAAGAA G AAC C AAC T A AAGGATTGTG 14 04 
Glu Ala Lys Gin Leu Val Gly * 
440 445 

4 0 TAATTAATGT AAAACTGTTT TAT CTT ATCA AAACAATGTT ATACAACATC TCATTTAAAA 14 64 

ACGAGATCAA TCAAAAAATA CAATCTTAAA TTCAAAACCA AAAAAAAAAA AAAAA 1519 

(2) INFORMATION FOR SEQ ID NO : 2 ; 

<i) SEQUENCE CHARACTERISTICS: 
. _ (A) LENGTH: 446 amino acids 

45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly His Gin Asn Ala Ala Val Ser Glu Asn Gin Asn His Asp Asp 
1 5 10 15 

Gly Ala Ala Ser Ser Pro Gly Phe Lys Leu Val Gly Phe Ser Lys Phe 
5 20 25 30 

Val Arg Lys Asn Pro Lys Ser Asp Lys Phe Lys Val Lys Arg Phe His 
35 40 45 

His He Glu Phe Trp Cys Gly Asp Ala Thr Asn Val Ala Arg Arg Phe 
50 55 60 

10 Ser Trp Gly Leu Gly Met Arg Phe Ser Ala Lys Ser Asp Leu Ser Thr 

65 70 .75 80 

Gly Asn Met Val His Ala Ser Tyr Leu Leu Thr Ser Gly Asp Leu Arg 
65 90 95 

Phe Leu Phe Thr Ala Pro Tyr Ser Pro Ser Leu Ser Ala Gly Glu He 
15 100 105 110 

Lys Pro Thr Thr Thr Ala Ser He Pro Ser Phe Asp His Gly Ser Cys 
115 120 125 

Arg Ser Phe Phe Ser Ser His Gly Leu Gly Val Arg Ala Val Ala He 
130 135 140 

20 Glu Val Glu Asp Ala Glu Ser Ala Phe Ser He Ser Val Ala Asn Gly 

145 150 155 160 

Ala He Pro Ser Ser Pro Pro He Val Leu Asn Glu Ala Val Thr He 
165 170 175 

Ala Glu Val Lys Leu Tyr Gly Asp Val Val Leu Arg Tyr Val Ser Tyr 
25 180 185 190 

Lys Ala Glu Asp Thr Glu Lys Ser Glu Phe Leu Pro Gly Phe Glu Arg 
195 200 205 

Val Glu Asp Ala Ser Ser Phe Pro Leu Asp Tyr Gly He Arg Arg Leu 
210 215 220 

3 0 Asp His Ala Val Gly Asn Val Pro Glu Leu Gly Pro Ala Leu Thr Tyr 

225 230 235 240 

Val Ala Gly Phe Thr Gly Phe His Gin Phe Ala Glu Phe Thr Ala Asp 
245 250 255 

Asp Val Gly Thr Ala Glu Ser Gly Leu Asn Ser Ala Val Leu Ala Ser 
35 260 265 270 

Asn Asp Glu Met Val Leu Leu Pro He Asn Glu Pro Val His Gly Thr 
275 280 285 

Lys Arg Lys Ser Gin He Gin Thr Tyr Leu Glu His Asn Glu Gly Ala 
290 295 300 

4 0 Gly Leu Gin His Leu Ala Leu Met Ser Glu Asp He Phe Arg Thr Leu 

305 310 315. 320 

Arg Glu Met Arg Lys Arg Ser Ser He Gly Gly Phe Asp Phe Met Pro 
325 330 335 

Ser Pro Pro Pro Thr Tyr Tyr Gin Asn Leu Lys Lys Arg Val Gly Asp 
45 340 345 350 
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Val Leu Ser Asp Asp Gin He Lys Glu Cys Glu Glu Leu Gly lie Leu 
355 360 365 

Val Asp Arg Asp Asp Gin Gly Thr Leu Leu Gin He Phe Thr Lys Pro 
370 375 380 

5 Leu Gly Asp Arg Pro Thr He Phe He Glu He He Gin Arg Val Gly 

385 390 395 400 

Cys Met Met Lys Asp Glu Glu Gly Lys Ala Tyr Gin Ser Gly Gly Cys 
405 410 415 

Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu Phe Lys Ser He Glu 
10 420 425 430 

Glu Tyr Glu Lys Thr Leu Glu Ala Lys Gin Leu Val Gly * 
435 440 445 
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5 CLAIMS 

I claim: 



10 



1. A biologically pure sample of DNA, the DNA 
comprising a DNA sequence coding for the expression of 
a plant p-hydroxyphenyl pyruvic acid dioxygenase . 

2 . A vector containing the DNA sequence of 
claim 1. 

3 . A microbial host transformed by the vector 
of claim 2 . 



4. The; DNA of claim 1, wherein the p- 

15 hydroxyphenyl pyruvic acid dioxygenase is from 
Arabidopsis thaliana. 

5. A transgenic tomato plant transformed with a 
DNA construct including the DNA of claim 1 . 

6. A transgenic Arabidopsis plant transformed 
20 with a DNA construct including the DNA of claim 1 

7. The biologically pure DNA of claim 1 wherein 
the DNA is SEQ ID NO: 2. 

8. A DNA plant gene expression construct 
comprising: 

25 a. a DNA sequence coding for the 

expression of a plant p-hydroxyphenyl pyruvic acid 

dioxygenase ; 

b. a promoter effective in plant cells 

located 5' to the DNA coding sequence; and 

30 c. a .3' termination sequence effective in 

plant cells . 
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A DNA construct conriDrisina: 



9. A DNA construct comprising: 

a. a promoter capable of expressing a 
downstream coding sequence in a tomato plant; 

b. a DNA sequence coding for the 
5 expression of a p-hydroxyphenyl pyruvic acid 

dioxygenase of plant origin; and 

c a 3' termination sequence, the 
construction capable of expressing a p-hydroxyphenyl 
pyruvic acid dioxygenase gene when transformed into 
10 tomato plants. 



10. A bacteria containing the construction of 
Claim 9 . 




11. A tomato plant cell containing the 
construction of Claim 9. 

12. An Arabidopsis plant cell containing the 
construction of Claim 9. 



13 . A transgenic tomato plant comprising in its 
genome a foreign genetic construction comprising, 5' 
to 3', a promoter effective in tomato, a DNA coding 
20 region encoding p-hydroxyphenyl pyruvic acid 

dioxygenase, and a transcriptional terminator, the 
genetic construction effective in vivo in tomato 
plants to stimulate expression of p-hydroxyphenyl 
pyruvic acid dioxygenase. 



25 14. Seed of the tomato plant of claim 13. 

15. Fruit of the tomato plant of claim 13. 

16 . The transgenic tomato plant of claim 11 
wherein the DNA coding region encoding p-hydroxyphenyl 
pyruvic acid dioxygenase is that set forth in SEQ ID 

30 NO:l. 
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17. A method of suppressing the production of 
vitamin E and plastoquinones in a plant, the method 
comprising the steps of: 

a. isolating a DNA sequence encoding a p- 
5 hydroxyphenyl pyruvic acid dioxygenase of plant 

origin; 

b. creating a genetic construction including, 

5' to 3', a promoter effective in the plant's cells, a 
coding sequence, and a transcriptional terminator, the 
10 coding region being derived from the DMA sequence, 
wherein the DNA sequence from step (a) has been 
altered so that expression of p- hydroxyphenyl pyruvic 
acid dioxygenase is suppressed; and 

c. transforming a cell of the plant with the 
15 genetic construction, whereby the plant's cell 

produces lowered levels of vitamin E and 
plastoquinones. 
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